BigCode(@BigCodeProject) 's Twitter Profileg
BigCode

@BigCodeProject

Open and responsible research and development of large language models for code. #BigCodeProject run by @huggingface + @ServiceNowRSRCH

ID:1554445522664148993

linkhttp://www.bigcode-project.org calendar_today02-08-2022 12:34:57

251 Tweets

8,8K Followers

3 Following

Rishabh Srivastava(@rishdotblog) 's Twitter Profile Photo

Important things I forgot to mention in the original tweet!

- SQLCoder is fine-tuned on StarCoder, an awesome initiative of the BigCode

- We used a slightly novel training approach. We first trained the model on 'easy' questions, and then trained the result of that on…

account_circle
Rishabh Srivastava(@rishdotblog) 's Twitter Profile Photo

We just open-sourced SQL Coder, a 15B param text-to-SQL model that outperforms OpenAI's gpt-3.5! When fine-tuned on an individual schema, it outperforms gpt-4. github.com/defog-ai/sqlco…

The model is small enough to run on a single A100 40GB in 16 bit floats, or on a single…

We just open-sourced SQL Coder, a 15B param text-to-SQL model that outperforms OpenAI's gpt-3.5! When fine-tuned on an individual schema, it outperforms gpt-4. github.com/defog-ai/sqlco… The model is small enough to run on a single A100 40GB in 16 bit floats, or on a single…
account_circle
Omar Sanseviero(@osanseviero) 's Twitter Profile Photo

Other interesting things

- BigCode and Argilla have the most downloaded datasets
- Open Assistant and Together AI are among the orgs with the most liked datasets
- Sentence Transformers keeps up with the most downloads

account_circle
Arjun Guha(@ArjunGuha) 's Twitter Profile Photo

LLMs are great at programming tasks... for Python and other very popular PLs. But, they are often unimpressive at artisanal PLs, like OCaml or Racket. We've come up with a way to significantly boost LLM performance of on low-resource languages. If you care about them, read on!

account_circle
Loubna Ben Allal(@LoubnaBenAllal1) 's Twitter Profile Photo

In the past two weeks, we've seen 4 new code models drop: StableCode, OctoCoder, OctoGeeX, and DeciCoder 🚀

Everyone's talking about 'HumanEval' – so how does code evaluation work & what makes reproducibility challenging?

A thread 🧵:

In the past two weeks, we've seen 4 new code models drop: StableCode, OctoCoder, OctoGeeX, and DeciCoder 🚀 Everyone's talking about 'HumanEval' – so how does code evaluation work & what makes reproducibility challenging? A thread 🧵:
account_circle
Loubna Ben Allal(@LoubnaBenAllal1) 's Twitter Profile Photo

Next Tuesday, I will give a webinar hosted by Analytics Vidhya on the training of LLMs for code, like StarCoder.

I will also discuss how to leverage these models using open-source libraries such as transformers, datasets and PEFT.

Register here: bit.ly/47BSnqK.

Next Tuesday, I will give a webinar hosted by @AnalyticsVidhya on the training of LLMs for code, like StarCoder. I will also discuss how to leverage these models using open-source libraries such as transformers, datasets and PEFT. Register here: bit.ly/47BSnqK.
account_circle
Niklas Muennighoff(@Muennighoff) 's Twitter Profile Photo

How to instruction tune Code LLMs w/o data? Releasing

🐙🤖OctoCoder & OctoGeeX: 46.2 on HumanEval🌟SoTA🌟of commercial LLMs
🐙📚CommitPack: 4TB of Git Commits
🐙🎒HumanEvalPack: HumanEval extended to 3 tasks & 6 lang

📜arxiv.org/abs/2308.07124
💻github.com/bigcode-projec…
1/9

How to instruction tune Code LLMs w/o #GPT4 data? Releasing 🐙🤖OctoCoder & OctoGeeX: 46.2 on HumanEval🌟SoTA🌟of commercial LLMs 🐙📚CommitPack: 4TB of Git Commits 🐙🎒HumanEvalPack: HumanEval extended to 3 tasks & 6 lang 📜arxiv.org/abs/2308.07124 💻github.com/bigcode-projec… 1/9
account_circle
Vlad Lialin(@guitaricet) 's Twitter Profile Photo

For all PhD students in small labs: find all possible ways to collaborate with well-known open research groups like EleutherAI LAION BigScience Research Workshop BigCode; apply to every single fellowship and look for connections. It’s not optional if you want to have a career.

account_circle
Leandro von Werra(@lvwerra) 's Twitter Profile Photo

The power of building tools, datasets, and models in the open: the community can build on top of it and everyone profits!

Exhibit A: since the release of 📑The Stack and ⭐️StarCoder research groups from academia and industry have trained models on top BigCode's releases.

The power of building tools, datasets, and models in the open: the community can build on top of it and everyone profits! Exhibit A: since the release of 📑The Stack and ⭐️StarCoder research groups from academia and industry have trained models on top BigCode's releases.
account_circle
Loubna Ben Allal(@LoubnaBenAllal1) 's Twitter Profile Photo

Inspired by the Open LLM LeaderBoard, and with several strong code models released, we created a Multilingual Code Leaderboard:

📊 10+ programming languages
⚡Throughput measurement
🔬 Fully reproducible
✉️ Open for submission of results

huggingface.co/spaces/bigcode…

Inspired by the Open LLM LeaderBoard, and with several strong code models released, we created a Multilingual Code Leaderboard: 📊 10+ programming languages ⚡Throughput measurement 🔬 Fully reproducible ✉️ Open for submission of results huggingface.co/spaces/bigcode…
account_circle
WizardLM(@WizardLM_AI) 's Twitter Profile Photo

WizardCoder attains the 🏆1st position on BigCode Multilingual Code Models Evaluation Leaderboard, which compares performance of base multilingual code generation models on HumanEval benchmark and MultiPL-E.

👉huggingface.co/spaces/bigcode…

account_circle
ServiceNow Research(@ServiceNowRSRCH) 's Twitter Profile Photo

Please join us in congratulating the BigCode community for the latest batch of 1B, 3B, 7B open-access StarCoder models for code-generation (and more), released today under a commercial use OpenRAIL-M license. Incredible work!

Join the collaboration here: bigcode-project.org/docs/about/joi…

account_circle
Xenova(@xenovacom) 's Twitter Profile Photo

AI code completion running 100% locally inside your browser, thanks to BigCode's StarCoder models and 🤗 Transformers.js!

We also got their new 1B model running at ~20 tokens per second in Node.js (CPU). 🚀

Check out the demo! huggingface.co/spaces/Xenova/…

account_circle
AndriyMulyar(@andriy_mulyar) 's Twitter Profile Photo

Starcoder 3B runs on CPU ⚡

Excited to launch Hugging Face's Starcoder model in gpt4all.io on CPU!

Local code models will be everywhere

account_circle