Blog Series
Series
- Deepseek Papers (3)
- Five ML Concepts (29)
- General Technology (2)
- How AI Learns (7)
- Machine Learning (6)
- Multi-Hop Reasoning (2)
- Personal Software (5)
- Small Models, Big Brains (6)
- Throwback Thursday (5)
- Towards Continuous LLM Learning (2)
Multi-part blog post series, organized by topic.
Deepseek Papers
-
Part 1:
Deepseek Papers (1/3): mHC - Training Stability at Any Depth
760 words • 4 min read • Abstract
-
Part 2:
Deepseek Papers (2/3): Engram - Conditional Memory for Transformers
705 words • 4 min read • Abstract
-
Part 3:
Deepseek Papers (3/3): Engram Revisited - From Emulation to Implementation
1033 words • 6 min read • Abstract
Five ML Concepts
-
Part 1:
Five ML Concepts - #1
411 words • 3 min read • Abstract
-
Part 2:
Five ML Concepts - #2
446 words • 3 min read • Abstract
-
Part 3:
Five ML Concepts - #3
524 words • 3 min read • Abstract
-
Part 4:
Five ML Concepts - #4
453 words • 3 min read • Abstract
-
Part 5:
Five ML Concepts - #5
493 words • 3 min read • Abstract
-
Part 6:
Five ML Concepts - #6
491 words • 3 min read • Abstract
-
Part 7:
Five ML Concepts - #7
469 words • 3 min read • Abstract
-
Part 8:
Five ML Concepts - #8
477 words • 3 min read • Abstract
-
Part 9:
Five ML Concepts - #9
470 words • 3 min read • Abstract
-
Part 10:
Five ML Concepts - #10
499 words • 3 min read • Abstract
-
Part 11:
Five ML Concepts - #11
503 words • 3 min read • Abstract
-
Part 12:
Five ML Concepts - #12
488 words • 3 min read • Abstract
-
Part 13:
Five ML Concepts - #13
448 words • 3 min read • Abstract
-
Part 14:
Five ML Concepts - #14
448 words • 3 min read • Abstract
-
Part 15:
Five ML Concepts - #15
470 words • 3 min read • Abstract
-
Part 16:
Five ML Concepts - #16
468 words • 3 min read • Abstract
-
Part 17:
Five ML Concepts - #17
472 words • 3 min read • Abstract
-
Part 18:
Five ML Concepts - #18
444 words • 3 min read • Abstract
-
Part 19:
Five ML Concepts - #19
451 words • 3 min read • Abstract
-
Part 20:
Five ML Concepts - #20
456 words • 3 min read • Abstract
-
Part 21:
Five ML Concepts - #21
447 words • 3 min read • Abstract
-
Part 22:
Five ML Concepts - #22
472 words • 3 min read • Abstract
-
Part 23:
Five ML Concepts - #23
440 words • 3 min read • Abstract
-
Part 24:
Five ML Concepts - #24
426 words • 3 min read • Abstract
-
Part 25:
Five ML Concepts - #25
406 words • 3 min read • Abstract
-
Part 26:
Five ML Concepts - #26
424 words • 3 min read • Abstract
-
Part 27:
Five ML Concepts - #27
419 words • 3 min read • Abstract
-
Part 28:
Five ML Concepts - #28
443 words • 3 min read • Abstract
-
Part 29:
Five ML Concepts - #29
457 words • 3 min read • Abstract
General Technology
-
Part 1:
MCP: Teaching Claude to Play (and Trash Talk)
661 words • 4 min read • Abstract
-
Part 2:
JSON et al: A Deep Dive into Data Serialization Formats
2241 words • 12 min read • Abstract
How AI Learns
-
Part 1:
How AI Learns Part 1: The Many Meanings of Learning
592 words • 3 min read • Abstract
-
Part 2:
How AI Learns Part 2: Catastrophic Forgetting vs Context Rot
641 words • 4 min read • Abstract
-
Part 3:
How AI Learns Part 3: Weight-Based Learning
649 words • 4 min read • Abstract
-
Part 4:
How AI Learns Part 4: Memory-Based Learning
627 words • 4 min read • Abstract
-
Part 5:
How AI Learns Part 5: Context Engineering & Recursive Reasoning
631 words • 4 min read • Abstract
-
Part 6:
How AI Learns Part 6: Toward Continuous Learning
691 words • 4 min read • Abstract
-
Part 7:
How AI Learns Part 7: Designing a Continuous Learning Agent
894 words • 5 min read • Abstract
Machine Learning
-
Part 1:
Solving Sparse Rewards with Many Eyes
1473 words • 8 min read • Abstract
-
Part 2:
DyTopo: Dynamic Topology for Multi-Agent AI
781 words • 4 min read • Abstract
-
Part 3:
RLM: Recursive Language Models for Massive Context
995 words • 5 min read • Abstract
-
Part 4:
Neural-Net-RS: An Educational Neural Network Platform
1048 words • 6 min read • Abstract
-
Part 5:
In-Context Learning Revisited: From Mystery to Engineering
643 words • 4 min read • Abstract
-
Part 6:
Many-Eyes Learning: Intrinsic Rewards and Diversity
1393 words • 7 min read • Abstract
Multi-Hop Reasoning
-
Part 1:
Multi-Hop Reasoning (1/2): Training Wheels for Small LLMs
692 words • 4 min read • Abstract
-
Part 2:
Multi-Hop Reasoning (2/2): The Distribution Trap
796 words • 4 min read • Abstract
Personal Software
-
Part 1:
Cat Finder: Personal Software via Vibe Coding
914 words • 5 min read • Abstract
-
Part 2:
midi-cli-rs: Music Generation for AI Coding Agents
1063 words • 6 min read • Abstract
-
Part 3:
midi-cli-rs: Extending with Custom Mood Packs
1300 words • 7 min read • Abstract
-
Part 4:
music-pipe-rs: Unix Pipelines for MIDI Composition
1173 words • 6 min read • Abstract
-
Part 5:
music-pipe-rs: Web Demo and Multi-Instrument Arrangements
697 words • 4 min read • Abstract
Small Models, Big Brains
-
Part 1:
Small Models (1/6): 976 Parameters Beat Billions
703 words • 4 min read • Abstract
-
Part 2:
Small Models (2/6): AI in Your Pocket
765 words • 4 min read • Abstract
-
Part 3:
Small Models (3/6): Planner + Doer = Genius
789 words • 4 min read • Abstract
-
Part 4:
Small Models (4/6): This AI Has a Visible Brain
842 words • 5 min read • Abstract
-
Part 5:
Small Models (5/6): Max AI Per Watt
839 words • 5 min read • Abstract
-
Part 6:
Small Models (6/6): Which Small AI Fits YOUR Laptop?
985 words • 5 min read • Abstract
Throwback Thursday
-
Part 1:
TBT (1/?): My First Program Was a Horse Race
1138 words • 6 min read • Abstract
-
Part 2:
TBT (2/?): Pipelines on OS/390
1779 words • 9 min read • Abstract
-
Part 3:
TBT (3/?): Vector Graphics Games
1633 words • 9 min read • Abstract
-
Part 4:
TBT (4/?): ToonTalk - Teaching Robots to Program
1069 words • 6 min read • Abstract
-
Part 5:
TBT (5/?): IBM 1130 System Emulator - Experience 1960s Computing
1231 words • 7 min read • Abstract
Towards Continuous LLM Learning
-
Part 1:
Towards Continuous LLM Learning (1): Sleepy Coder - When Fine-Tuning Fails
1211 words • 7 min read • Abstract
-
Part 2:
Towards Continuous LLM Learning (2): Routing Prevents Forgetting
775 words • 4 min read • Abstract