User avatar
autumnsauce @autumn@cafe.autumn.town
8mo
@halva ive done finetuning on BERT which is like 440 Million and that was 40 minutes on 10k lines of data on a 6700XT with 12gb of VRAM. its doable but slow. anything beyond that is gonna be rough yeah. i think its even funnier to have a 0.5 bil parameter shit model try to write out a sentence like me tho 😭