Identifying and Removing Dead Neurons in Training Neural Networks: Mechanistic Interpretability Part 2Dec 30, 2024·9 min read
Visualizing the Activation, Gradients, Gradient to data ratio, and Update to Data Distributions in Neural Networks Training.In my last three blog posts, we explained about how to reduce the wrong but high confidence of neural network, how to identify dead neurons, and The Kaiming initialization and batch normalization. The core of these three blog post is, the correct ini...Jan 21, 2025·21 min read
How to correctly Initialize the Neural Network: Mechanistic Interpretability Part 1Dec 20, 2024·9 min read
How basic concept of Calculus (Derivative) has a Key Role in Training Neural Networks?A Neural Network is a machine learning model that works like the human brain. Just like how our brain has neurons that send signals, a neural network has artificial neurons that pass information. The network takes input, does some math, and passes th...Sep 4, 2024·9 min read