Alpay Ariyak (@alpayariyak) 's Twitter Profile
Alpay Ariyak

@alpayariyak

Post-Training Lead @ Together AI | 1/2 of @OpenChatDev (1.5M+ downloads, #1 7B LLM on Arena for 2+ months)

ID: 1682513195813027842

linkhttps://huggingface.co/openchat/openchat-3.5-0106 calendar_today21-07-2023 22:10:33

188 Tweet

1,1K Followers

2,2K Following

Alpay Ariyak (@alpayariyak) 's Twitter Profile Photo

Pleasantly surprised to see that our old Mistral AI 7B-based OpenChat model from January is the most popular generalist LLM fine-tune released by the open source on OpenRouter 🫶

Pleasantly surprised to see that our old <a href="/MistralAI/">Mistral AI</a> 7B-based OpenChat model from January is the most popular generalist LLM fine-tune released by the open source on <a href="/OpenRouterAI/">OpenRouter</a> 🫶
Alpay Ariyak (@alpayariyak) 's Twitter Profile Photo

Every day I wait patiently for them to drop the paper for the actual details on their new fine-tuning data generation pipeline… please WizardLM we need it

Alpay Ariyak (@alpayariyak) 's Twitter Profile Photo

Planning to build a centralized library for synthetic data generation for LLM training(sft first, then pt, rlhf, etc). Will allow you to build pipelines that combine and chain multiple data generation methods together, and modify them individually. Will be housing both

Alpay Ariyak (@alpayariyak) 's Twitter Profile Photo

New Orca paper, data generation pipeline looks quite promising for performance and quality in real world usage Microsoft is greedy with datasets, so it likely won’t be released, but please share the generation prompts at least, so the pipeline can be accurately reproduced 🙏🏼

Alpay Ariyak (@alpayariyak) 's Twitter Profile Photo

I have an exciting update - I’ve recently joined Together AI Together AI as a Research Scientist to lead Post-Training! Having the time of my life :)