Uncategorized

Model Compression and Efficient Inference for Large Language Models: A Survey



"Large Language Models"arXiv:2402.09748v1 Announce Type: cross Abstract: Transformer based large language models have achieved tremendous success.



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *