2024 06 Dissent
🧻 Our paper Gradient Dissent in Language Model Pretraining and Saturation got accepted into the ICML 2024 Workshop on High-dimensional learning dynamics!
🧻 Our paper Gradient Dissent in Language Model Pretraining and Saturation got accepted into the ICML 2024 Workshop on High-dimensional learning dynamics!