Multi-part blog post series, organized by topic.

Deepseek Papers

  • Part 1: Deepseek Papers (1/3): mHC - Training Stability at Any Depth

    760 words4 min readAbstract

  • Part 2: Deepseek Papers (2/3): Engram - Conditional Memory for Transformers

    705 words4 min readAbstract

  • Part 3: Deepseek Papers (3/3): Engram Revisited - From Emulation to Implementation

    1033 words6 min readAbstract

Five ML Concepts

  • Part 1: Five ML Concepts - #1

    411 words3 min readAbstract

  • Part 2: Five ML Concepts - #2

    446 words3 min readAbstract

  • Part 3: Five ML Concepts - #3

    524 words3 min readAbstract

  • Part 4: Five ML Concepts - #4

    453 words3 min readAbstract

  • Part 5: Five ML Concepts - #5

    493 words3 min readAbstract

  • Part 6: Five ML Concepts - #6

    491 words3 min readAbstract

  • Part 7: Five ML Concepts - #7

    469 words3 min readAbstract

  • Part 8: Five ML Concepts - #8

    477 words3 min readAbstract

  • Part 9: Five ML Concepts - #9

    470 words3 min readAbstract

  • Part 10: Five ML Concepts - #10

    499 words3 min readAbstract

  • Part 11: Five ML Concepts - #11

    503 words3 min readAbstract

  • Part 12: Five ML Concepts - #12

    488 words3 min readAbstract

  • Part 13: Five ML Concepts - #13

    448 words3 min readAbstract

  • Part 14: Five ML Concepts - #14

    448 words3 min readAbstract

  • Part 15: Five ML Concepts - #15

    470 words3 min readAbstract

  • Part 16: Five ML Concepts - #16

    468 words3 min readAbstract

  • Part 17: Five ML Concepts - #17

    472 words3 min readAbstract

  • Part 18: Five ML Concepts - #18

    444 words3 min readAbstract

  • Part 19: Five ML Concepts - #19

    451 words3 min readAbstract

  • Part 20: Five ML Concepts - #20

    456 words3 min readAbstract

  • Part 21: Five ML Concepts - #21

    447 words3 min readAbstract

  • Part 22: Five ML Concepts - #22

    472 words3 min readAbstract

  • Part 23: Five ML Concepts - #23

    440 words3 min readAbstract

  • Part 24: Five ML Concepts - #24

    426 words3 min readAbstract

  • Part 25: Five ML Concepts - #25

    406 words3 min readAbstract

  • Part 26: Five ML Concepts - #26

    424 words3 min readAbstract

  • Part 27: Five ML Concepts - #27

    419 words3 min readAbstract

  • Part 28: Five ML Concepts - #28

    443 words3 min readAbstract

  • Part 29: Five ML Concepts - #29

    457 words3 min readAbstract

General Technology

  • Part 1: MCP: Teaching Claude to Play (and Trash Talk)

    661 words4 min readAbstract

  • Part 2: JSON et al: A Deep Dive into Data Serialization Formats

    2241 words12 min readAbstract

How AI Learns

  • Part 1: How AI Learns Part 1: The Many Meanings of Learning

    592 words3 min readAbstract

  • Part 2: How AI Learns Part 2: Catastrophic Forgetting vs Context Rot

    641 words4 min readAbstract

  • Part 3: How AI Learns Part 3: Weight-Based Learning

    649 words4 min readAbstract

  • Part 4: How AI Learns Part 4: Memory-Based Learning

    627 words4 min readAbstract

  • Part 5: How AI Learns Part 5: Context Engineering & Recursive Reasoning

    631 words4 min readAbstract

  • Part 6: How AI Learns Part 6: Toward Continuous Learning

    691 words4 min readAbstract

  • Part 7: How AI Learns Part 7: Designing a Continuous Learning Agent

    894 words5 min readAbstract

Machine Learning

  • Part 1: Solving Sparse Rewards with Many Eyes

    1473 words8 min readAbstract

  • Part 2: DyTopo: Dynamic Topology for Multi-Agent AI

    781 words4 min readAbstract

  • Part 3: RLM: Recursive Language Models for Massive Context

    995 words5 min readAbstract

  • Part 4: Neural-Net-RS: An Educational Neural Network Platform

    1048 words6 min readAbstract

  • Part 5: In-Context Learning Revisited: From Mystery to Engineering

    643 words4 min readAbstract

  • Part 6: Many-Eyes Learning: Intrinsic Rewards and Diversity

    1393 words7 min readAbstract

Multi-Hop Reasoning

  • Part 1: Multi-Hop Reasoning (1/2): Training Wheels for Small LLMs

    692 words4 min readAbstract

  • Part 2: Multi-Hop Reasoning (2/2): The Distribution Trap

    796 words4 min readAbstract

Personal Software

  • Part 1: Cat Finder: Personal Software via Vibe Coding

    914 words5 min readAbstract

  • Part 2: midi-cli-rs: Music Generation for AI Coding Agents

    1063 words6 min readAbstract

  • Part 3: midi-cli-rs: Extending with Custom Mood Packs

    1300 words7 min readAbstract

  • Part 4: music-pipe-rs: Unix Pipelines for MIDI Composition

    1173 words6 min readAbstract

  • Part 5: music-pipe-rs: Web Demo and Multi-Instrument Arrangements

    697 words4 min readAbstract

Small Models, Big Brains

  • Part 1: Small Models (1/6): 976 Parameters Beat Billions

    703 words4 min readAbstract

  • Part 2: Small Models (2/6): AI in Your Pocket

    765 words4 min readAbstract

  • Part 3: Small Models (3/6): Planner + Doer = Genius

    789 words4 min readAbstract

  • Part 4: Small Models (4/6): This AI Has a Visible Brain

    842 words5 min readAbstract

  • Part 5: Small Models (5/6): Max AI Per Watt

    839 words5 min readAbstract

  • Part 6: Small Models (6/6): Which Small AI Fits YOUR Laptop?

    985 words5 min readAbstract

Throwback Thursday

  • Part 1: TBT (1/?): My First Program Was a Horse Race

    1138 words6 min readAbstract

  • Part 2: TBT (2/?): Pipelines on OS/390

    1779 words9 min readAbstract

  • Part 3: TBT (3/?): Vector Graphics Games

    1633 words9 min readAbstract

  • Part 4: TBT (4/?): ToonTalk - Teaching Robots to Program

    1069 words6 min readAbstract

  • Part 5: TBT (5/?): IBM 1130 System Emulator - Experience 1960s Computing

    1231 words7 min readAbstract

Towards Continuous LLM Learning

  • Part 1: Towards Continuous LLM Learning (1): Sleepy Coder - When Fine-Tuning Fails

    1211 words7 min readAbstract

  • Part 2: Towards Continuous LLM Learning (2): Routing Prevents Forgetting

    775 words4 min readAbstract