Six steps to make your agent smarter using small local models. | Columbus .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

May 04, 2026 · Columbus

Six steps to make your agent smarter using small local models.

Learn a six-step process to train a small local model on your Mac Mini, making it as capable as larger models like OpenAI 5.4.

Overview
Tech stack
  • OpenAI
    OpenAI is an AI research and deployment company: We build safe artificial general intelligence (AGI) to benefit all of humanity.
    OpenAI is a premier AI research and deployment company, focused on developing safe Artificial General Intelligence (AGI) for global benefit. The organization operates under a unique structure: a non-profit Foundation governs a for-profit Group, which functions as a public benefit corporation. Its technology portfolio includes industry-defining models like the GPT series (e.g., GPT-4o, GPT-5.1), the conversational platform ChatGPT, and the text-to-video model Sora. These tools drive innovation across multiple sectors, providing powerful, accessible AI capabilities for developers, businesses, and consumers worldwide.
  • Qwin 2
    Qwen2 is Alibaba Cloud's next-generation series of open-source large language models featuring enhanced multilingual capabilities and state-of-the-art performance in coding and mathematics.
    Alibaba Cloud's Qwen2 series represents a major leap in open-source AI, offering five distinct model sizes ranging from 0.5B to 72B parameters. These models are pretrained on over 27 languages and utilize Group Query Attention (GQA) to optimize inference speed and memory efficiency. With a massive context window of up to 128K tokens, Qwen2 excels at processing complex datasets and long-form content. The flagship 72B-Instruct model consistently tops leaderboards (outperforming Llama-3-70B in key benchmarks) while maintaining strict alignment with human values through advanced post-training techniques.
  • A2A
    A2A (Agent2Agent) is the open communication protocol enabling secure, cross-platform collaboration between specialized AI agents.
    A2A, or Agent2Agent Protocol, is the open standard for AI agent interoperability, launched in April 2025 by Google and over 50 technology partners (e.g., Atlassian, SAP, MongoDB). It defines a standardized language for autonomous agents to discover capabilities, delegate tasks, and securely exchange information. The protocol leverages established web standards like HTTPS and JSON-RPC 2.0 to ensure enterprise-grade security and reliability. This framework breaks down silos, allowing a specialized fraud-detection agent, for example, to seamlessly coordinate with a customer-service agent, automating complex, multi-step workflows across diverse vendor ecosystems.
  • MLX-LM
    The Python package for efficient text generation and fine-tuning of Large Language Models (LLMs) directly on Apple silicon via the MLX framework.
    MLX-LM is a high-performance Python package engineered for text generation and fine-tuning of Large Language Models (LLMs) on Apple silicon, leveraging the core MLX array framework. It provides seamless integration with the Hugging Face Hub, allowing users to easily access and run thousands of LLMs with a single command. Key features include native support for 4-bit quantization to reduce model memory footprint and efficient low-rank or full model fine-tuning. This package enables developers to maximize the unified memory architecture of Apple silicon for faster, on-device machine learning workflows.
  • Mac Mini
    A 5-inch square powerhouse packing M4 or M4 Pro silicon and pro-grade I/O into a thermal-efficient desktop frame.
    Apple's latest Mac Mini redesign shrinks the footprint to a 5 by 5-inch chassis while doubling down on performance. The M4 Pro model supports up to 64GB of unified memory and 273GB/s bandwidth (ideal for heavy creative workflows or local LLMs). Connectivity is dense: three Thunderbolt 5 ports on the rear handle 120Gb/s transfers, while front-facing USB-C ports and a high-impedance headphone jack simplify daily peripherals. It drives up to three 6K displays simultaneously, making it the most efficient power-to-size ratio in the current Mac lineup.