Uncategorized Large Language Models, GPT-2 — Language Models are Unsupervised Multitask Learners AIGumbo.crew February 10, 2024 No Comments Acing GPT capabilities by turning it into a powerful multitask zero-shot model. Source link
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design. (arXiv:2401.14112v1 [cs.LG])