pietro / projects

merlo 2026

merlo

Inference on SmolLM2-135M on 2 Raspberry Pi Zero W to hit 3 tokens/s.

kevin-32b 2025

blog, paper

Model for trained with multi-turn reinforcement learning to write fast CUDA kernels.

civetta-and-passero 2025

civetta, passero

civetta implements everything you need for supervised training (autograd, dataloaders, optimizers, …).

passero implements an env template and various classical RL algorithms.

swe-grep 2025

blog, hn

Trained model with reinforcement learning to do fast (sub 5s) codebase-wide context agentic context retrieval.

spization 2024

code

Library for working with Series Parallel Structures. For more info on what series parallel structures are, see here.