Smaug-72B-v0.1: The New Open-Source LLM Roaring to the Top of the Leaderboard

hexual@lemmy.world · 1 year ago

Smaug-72B-v0.1: The New Open-Source LLM Roaring to the Top of the Leaderboard

cm0002@lemmy.world · 1 year ago

Oh if only it were so simple lmao, you need ~130GB of VRAM, aka the graphics card RAM. So you would need about 9 consumer grade 16GB graphics cards and you’ll probably need Nvidia because of fucking CUDA so we’re talking about thousands of dollars. Probably approaching 10k

Ofc you can get cards with more VRAM per card, but not in the consumer segment so even more $$$$$$

kakes@sh.itjust.works · 1 year ago

Afaik you can substitute VRAM with RAM at the cost of speed. Not exactly sure how that speed loss correlates to the sheer size of these models, though. I have to imagine it would run insanely slow on a CPU.

Infiltrated_ad8271@kbin.social · edit-2 1 year ago

I tested it with a 16GB model and barely got 1 token per second. I don’t want to imagine what it would take if I used 16GB of swap instead, let alone 130GB.

Smaug-72B-v0.1: The New Open-Source LLM Roaring to the Top of the Leaderboard

Smaug-72B-v0.1: The New Open-Source LLM Roaring to the Top of the Leaderboard

abacusai/Smaug-72B-v0.1 · Hugging Face