Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
March 19, 2024
·
Los Angeles
Self-Rewarding Language Models
This talk demonstrates reproducing the Self-Rewarding Language Model from MetaAI using open source models on accessible hardware, highlighting practical implementation.
Overview
We reproduced the Self-Rewarding Language Model paper from the team at MetaAI but with open source models
Links
Automates Self-Rewarding LLM reproduction via SFT, scoring, and DPO training.
Tech stack