TensorFlow reposted this
📣 🧠 Exciting news for researchers pushing the boundaries of efficient deep learning! We've scaled RecurrentGemma to 9 billion parameters! Dive into the details and access the model on: 📘 Kaggle → https://goo.gle/3xd1IYs 🤗 Hugging Face → https://goo.gle/3KDkknt 🚀 This new model achieves performance comparable to the largest Gemma 1 model, but with significantly greater efficiency. That means lower memory requirements and faster sampling speeds, especially for long sequences or large batch sizes. For example, on a single TPU-v4, it delivers 80x higher throughput when sampling 1k tokens from a 2k token prompt.