News Sharing
For sharing news, please enter the email address of you and the receiver, then press SEND button.*Mandatory Fields
Receiver*
Enter email addresses, separated by semicolon (;). E.g. a@a.com;b@b.com
Your email address*
Content Sharing
Google Research Releases Compression Algorithm TurboQuant to Reduce AI Model Memory Usage
Google Research released TurboQuant, a training-free compression algorithm that can compress the KV cache of large language models (LLM) to 3 bits without affecting model accuracy,...
Reset
Send
The window will close in 5 seconds
Google Research Releases Compression Algorithm TurboQuant to Reduce AI Model Memory Usage
Close
Recommend
2
Positive
9
Negative
2
 
 

Google Research released TurboQuant, a training-free compression algorithm that can compress the KV cache of large language models (LLM) to 3 bits without affecting model accuracy, on Tuesday (24th), according to foreign media.

In benchmark tests on Nvidia (NVDA.US)      's H100 GPUs, compared to unquantized 32-bit keys, the 4-bit TurboQuant can increase the efficiency of computing attention logits by up to 8x, while reducing the KV cache memory by at least 6x.

Related NewsCore PPI MoM for Apr in United States is 1.0%, higher than the previous value of 0.2%. The forecast was 0.3%.
Memory stocks Sandisk (SDNK.US) and Micron Technology (MU.US)      cascaded 3.5% and 3.4% each overnight (25th).
(Real-time Streaming US Stocks Quote; Except All OTC quotes are at least 15 minutes delayed.)

Auto-translated by AI
This article was automatically translated by AI, the original language version should be considered the authoritative version. AASTOCKS.com Limited does not guarantee its accuracy or completeness and accepts no liability for any damages or losses arising from the use of this translation.

AASTOCKS Financial News

Copyright(C) AASTOCKS.com Limited 2000. All rights reserved.
Disclaimer: AASTOCKS.com Ltd, HKEx Information Services Limited, its holding companies and/or any subsidiaries of such holding companies endeavour to ensure the accuracy and reliability of the Information provided but do not guarantee its accuracy or reliability and accept no liability (whether in tort or contract or otherwise) for any loss or damage arising from any inaccuracies or omissions.