HPLT (@hplt_eu) 's Twitter Profile
HPLT

@hplt_eu

Horizon Europe - High Performance Language Technology (HPLT)

ID: 1542506822573051910

linkhttp://hplt-project.org calendar_today30-06-2022 13:54:35

48 Tweet

234 Followers

15 Following

LTG Oslo (@ltgoslo) 's Twitter Profile Photo

It's snowing large language models this week in Norway! 1st, the 5th NLPL and HPLT Winter School on LLMs is ongoing now in Skeikampen And 2nd, the LTG has released three fully open generative language models for Norwegian, based on Mistral and BLOOM architectures #NLProc

It's snowing large language models this week in Norway!  
1st, the 5th NLPL and <a href="/hplt_eu/">HPLT</a> Winter School on LLMs is ongoing now in Skeikampen
And 2nd, the LTG has released three fully open generative language models for Norwegian, based on Mistral and BLOOM architectures
#NLProc
Konstantin Dobler (@konstantdobler) 's Twitter Profile Photo

Attending the HPLT & NLPL Winter School in Skeikampen, Norway was a blast and highly recommended if you are interested in Large Language Models. Bonus: we had our very own take on building a snowman!

Attending the <a href="/hplt_eu/">HPLT</a> &amp; NLPL Winter School in Skeikampen, Norway was a blast and highly recommended if you are interested in Large Language Models. Bonus: we had our very own take on building a snowman!
HPLT (@hplt_eu) 's Twitter Profile Photo

HPLT scientific highlights directly from one of the authors : "LTG-BERT: an efficient LM architecture developed within the HPLT project, won the BabyLM challenge. Look forward to our future release of 75 LTG-BERTs for 75 different languages." Thanks David Samuel and congrats!

HPLT scientific highlights directly from one of the authors : 
"LTG-BERT: an efficient LM architecture developed within the HPLT project, won the BabyLM challenge. Look forward to our future release of 75 LTG-BERTs for 75 different languages." 
Thanks <a href="/davidsamuelcz/">David Samuel</a> and congrats!
HPLT (@hplt_eu) 's Twitter Profile Photo

HPLT partners have explored the effects of multililingual vs monolingual instruction tuning, under a constrained budget and a self-made machine-translated Alpaca-based dataset. Spoiler: go multilingual! Work will be presented at #eacl2024 Findings: arxiv.org/abs/2309.08958

HPLT (@hplt_eu) 's Twitter Profile Photo

We will be presenting the HPLT datasets HOW-TO and insights at LREC COLING 2024 in Torino. Paper already in Arxiv.org: arxiv.org/pdf/2403.14009….

We will be presenting the HPLT datasets HOW-TO and insights at <a href="/LrecColing/">LREC COLING 2024</a> in Torino. Paper already in Arxiv.org: arxiv.org/pdf/2403.14009….
HPLT (@hplt_eu) 's Twitter Profile Photo

Good news for small languages and LLMs. Paper on open Poro 34B model shows how training on Finnish, English and programming languages creates a very strong Finnish model, that excels in translation and is competitive in generating English and code: arxiv.org/abs/2404.01856

HPLT (@hplt_eu) 's Twitter Profile Photo

Will you be at LREC COLING 2024 next week? HPLT will! 🥳 Don't miss: - our poster on Thursday 23, 15:30, about FastSpell, one of the langID technologies of our dataset pipeline. (paper 1571) - our presentation on Friday 24, 9:20 for all details about HPLT massive dataset (paper 2199)

Silicon Vikings 🇩🇰🇪🇪🇫🇮🇮🇸🇱🇻🇱🇹🇳🇴🇸🇪 (@siliconvikings) 's Twitter Profile Photo

Europe’s largest private #AI lab #helyes Silo AI, #tribetampere Turun yliopisto - University of Turku’s research group TurkuNLP + HPLT released the 1st multilingual large language model (LLM) for all Nordic languages + English + programming languages. By Tech.eu tech.eu/2024/05/15/sil… #NordicMade

Ona de Gibert (@onadegibert) 's Twitter Profile Photo

LREC COLING 2024 has arrived! We will presenting our work on how we built the HPLT datasets! 📅 Friday 24th of May ⏰ 9.20h-9.40h 📍Room Londra ⌛️Session D3-S1-R3 - Multilinguality, Machine Translation, and Translation Aids II

Ona de Gibert (@onadegibert) 's Twitter Profile Photo

Still recovering from the excitement of #lreccoling2024, where we presented the HPLT resources! We introduce: - monoHPLT: monolingual collection covering 75 languages - biHPLT: parallel data for 18 language pairs - multiHPLT: synthetic data obtained through pivoting

Still recovering from the excitement of #lreccoling2024, where we presented the <a href="/hplt_eu/">HPLT</a> resources! We introduce:
- monoHPLT:  monolingual collection covering 75 languages
- biHPLT: parallel data for 18 language pairs
- multiHPLT: synthetic data obtained through pivoting
Institute of Formal and Applied Linguistics (@ufal_cuni) 's Twitter Profile Photo

📢 Job offer: Work with us! 🤓 Institute of Formal and Applied Linguistics Matematicko-fyzikální fakulta Univerzity Karlovy is looking for 🖥️ a Front-End and ⌨️ a Back-End Java developer to work on 🇪🇺 European Open Science Cloud. More details are at ufal.mff.cuni.cz/jobs. The application deadline is 🗓️ Aug 28.

Institute of Formal and Applied Linguistics (@ufal_cuni) 's Twitter Profile Photo

The MT Marathon continues on its third day! We already had great talks by Ondrej Bojar, Raj Dabre, Vilém Zouhar, and Elizabeth Salesky 👏 and a poster session with 10 posters 🖼️. Today, we continue with more talks, and of course, the week-long hackathon continues with interesting projects.

The MT Marathon continues on its third day! We already had great talks by <a href="/OndrejBojar/">Ondrej Bojar</a>, <a href="/prajdabre1/">Raj Dabre</a>, <a href="/zouharvi/">Vilém Zouhar</a>, and <a href="/esalesk/">Elizabeth Salesky</a> 👏 and a poster session with 10 posters 🖼️. Today, we continue with more talks, and of course, the week-long hackathon continues with interesting projects.