Writing & Engineering Notes

AI systems, shipped clearly.

Notes on agentic workflows, RAG systems, ML engineering, and building useful AI products. Deep-dives into agentic workflows, RAG, and applied ML.

12 entries found

April 22, 20265 min read
Fine-Tuning LLMs in 2026: How LoRA and QLoRA Deliver 95% of Full-Tune Performance with 10,000x Fewer Parameters

Updated guide to parameter-efficient fine-tuning, covering recent advances in low-rank adaptation, quantization, multi-task adaptation, and hardware-aware optimizations that make customizing large models accessible on consumer hardware.

Fine-tuningLoRA
April 22, 20266 min read
MetaComp StableX: The First AI Agent Governance Framework for Regulated Financial Services

Analysis of MetaComp's StableX Know Your Agent (KYA) Framework—a governance standard for AI agents operating in payments, compliance, and wealth management, launched at Money20/20 Asia.

AI GovernanceFinancial Services
April 22, 20265 min read
Microsoft's Agent Governance Toolkit: Runtime Security for Autonomous AI Agents

Deep dive into Microsoft's open‑source Agent Governance Toolkit—a hypervisor‑based framework that brings deterministic policy enforcement, zero‑trust identity, and execution sandboxing to autonomous AI agents.

AI SecurityAgent Governance
April 22, 20265 min read
Hugging Face ml‑intern: Automating LLM Post‑Training with an AI Agent

Deep dive into Hugging Face's ml‑intern—an open‑source AI agent that automates end‑to‑end LLM post‑training workflows, from literature review and data validation to fine‑tuning and deployment.

Hugging FaceML Automation
April 22, 20266 min read
OpenAI Agents SDK: A Lightweight Python Framework for Multi‑Agent Workflows

Deep dive into OpenAI's newly released Agents SDK—a lightweight, production‑ready Python framework for orchestrating multi‑agent workflows with built‑in tool‑calling, memory management, and real‑time streaming.

OpenAIAI Agents
April 21, 20269 min read
GenAI Processors v2.0: Google's Unified Framework for Modular, Streaming AI Pipelines

A technical deep dive into Google's open‑source library for building composable, asynchronous AI pipelines — with 2,108 stars, 212 forks, and a dual‑interface pattern that abstracts away streaming complexity.

GenAI ProcessorsGoogle Gemini