icpp_pro (@icpp_pro) Twitter Tweets • TwiCopy

4 days ago

This is amazing.

thumb_up_off_alt3

repeat0

account_circle

So I need to convert my llama.c models to llama.cpp, and found this incredible discussion between developers that did it already sometime last year.

This is why Open Source rules....

github.com/ggerganov/llam…

thumb_up_off_alt15

repeat2

account_circle

icpp_pro

1 week ago

Congrats also to DGDG (dgastonia).

This was the first time I used it, to buy my medallion, and the process was super smooth. Well done!

thumb_up_off_alt13

repeat3

account_circle

icpp_pro

1 week ago

Got a cool Jester Medallion.
Thanks LightningLad !

thumb_up_off_alt16

repeat2

account_circle

icpp_pro

1 week ago

I uploaded all my trained models & tokenizers for on-chain llama2.c to HuggingFace.

I can't wait for the floating point update to be released on main net of the Internet Computer, so I can try out the 42M and 110M parameter models that currently hit the instruction limit.

Note…

thumb_up_off_alt18

repeat4

account_circle

icpp_pro

1 week ago

Past week DFINITY Scalability and Performance team took ICGPT backend LLM for a spin in the new environment demonstrated for image classification earlier. Results came out wonderful. After saving billions of instructions, the generated result was identical.

Truly amazing work…

thumb_up_off_alt26

repeat7

account_circle

icpp_pro

1 week ago

After hearing lastmjs ∞ describe the vision for Kybra during the ICP HUB townhall, I gave myself a stretch goal. Don't just port llama.cpp to the IC, but also port the python bindings. Because in python is where most of the use is....

thumb_up_off_alt22

repeat2

account_circle

icpp_pro

2 weeks ago

Today I tested a beta version of DeVinci which allows you to upload a PDF file from local disk into a vector database that runs side by side with the Llama3 LLM in the browser, all locally on your device.

The DeVinci dApp itself is served from the Internet Computer, fully…

account_circle

Andrej Karpathy

@karpathy

2 weeks ago

🔥llm.c update: Our single file of 2,000 ~clean lines of C/CUDA code now trains GPT-2 (124M) on GPU at speeds ~matching PyTorch (fp32, no flash attention)
github.com/karpathy/llm.c…

On my A100 I'm seeing 78ms/iter for llm.c and 80ms/iter for PyTorch. Keeping in mind this is fp32,…

account_circle

Jeshli 🦇🔊💻🔧

@Jeshli

3 weeks ago

Floating point operations are about to get 10x cheaper on the #IC ! #AI

thumb_up_off_alt25

repeat4

account_circle

icpp_pro

3 weeks ago

Big progress in scaling my LLM on the IC to multiple concurrent users, thanks to the scalability experts from DFINITY.

This is a great configuration for my case, using 4 LLMs per subnet. Check out how much faster inference is when you go from 1 to 4 LLMs behind a load-balancer…

account_circle

icpp_pro

3 weeks ago

I needed this badly. Thank you for developing it.

thumb_up_off_alt8

repeat0