- The limitations of deep learning
- Implementing a Transformer From Scratch
- A ChatGPT Emacs shell
- Tensors and Convolution
- Transformer Deep Dive: Parameter Counting
- What Are Transformer Models and How Do They Work?
- Understanding LSTM Networks
- Some remarks on Large Language Models
- LLM Sandboxing: Early Lessons Learned
- Transformer Math 101
- GPT best practices
- Three techniques to adapt LLMs for any use case
- How does Machine Learning work?
- Large Language Models and Search
- What Is a Transformer Model?
- The self-supervised learning cookbook