From Transformers to Mamba: Is Attention All We Need? less than 1 minute read Published: June 01, 2024
A Comparison of Probabilistic and Statistic View of Deep Learning Models 8 minute read Published: March 01, 2024
Polynomials: the Good (theory), the Bad (practice), and the Ugly (from theory to practice) 4 minute read Published: February 16, 2024