🍀 Spring Date with Fortune, Prizes with Raffle! Growth Value Phase 1️⃣ 7️⃣ Spring Raffle Carnival Begins!
Seize Spring's Good Luck! 👉 https://www.gate.com/activities/pointprize?now_period=17
🌟 How to Participate?
1️⃣ Enter [Square] personal homepage, click the points icon next to your avatar to enter [Community Center]
2️⃣ Complete Square or Hot Chat tasks such as posting, commenting, liking, speaking to earn growth value
🎁 Every 300 points can raffle once, 10g gold bars, Gate Red Bull gift box, VIP experience card and more prizes waiting for you to win!
Details 👉 https://www.gate.com/ann
谷歌推出压缩算法TurboQuant,宣称實現約6倍内存节省
谷歌推出一种可能降低人工智能系統内存需求的压缩算法TurboQuant。TurboQuant压缩技术旨在降低大語言模型和向量搜索引擎的内存占用。该算法主要针對AI系統中用于存储高頻访問信息的键值缓存(key-value cache)瓶颈問题。隨着上下文窗口变大,這些缓存正成為主要的内存瓶颈。TurboQuant可在無需重新训练或微調模型的情况下,将键值缓存压缩至3bit精度,同時基本保持模型准确率不受影响。對包括Gemma等開源模型的测试显示,该技术可實現約6倍的键值缓存内存压缩效果。(財联社)