Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Gradient Bang: LLM-Native MMO Game
See a demo of Gradient Bang, an LLM-native massively multiplayer online game that grew from sample code into an active community of players and contributors.
Kwindla builds real-time voice and AI infrastructure at Daily and created the open-source Pipecat framework. He’ll be demoing Gradient Bang, a massively multiplayer online game built from the ground up around LLMs that started as sample code and broke containment into an active community of players and contributors.
Procedural CSS generator creating complex, randomized linear and radial gradients.
- PipecatPipecat is the open-source Python framework for building ultra-low-latency, real-time voice and multimodal AI bots that see, hear, and speak.Pipecat is your go-to open-source Python framework for orchestrating real-time, multimodal conversational AI applications. It directly solves the complex coordination problem of integrating services like Speech Recognition (Deepgram), Large Language Models (OpenAI GPT), and Speech Synthesis (Cartesia) into a single, seamless pipeline. This pipeline architecture ensures ultra-low latency, with typical voice interactions completing in a tight 500-800ms, making conversations feel natural and responsive. Use Pipecat to build everything from phone agents and voice assistants to complex multimodal apps incorporating audio, video, and text.
- LLMLarge Language Models (LLMs) are deep learning models, built on the Transformer architecture, that process and generate human-quality text and code at scale.LLMs are a class of foundation models: massive, pre-trained neural networks (often with billions to trillions of parameters) that leverage the self-attention mechanism of the Transformer architecture (introduced in 2017) to predict the next token in a sequence. Trained on vast datasets (e.g., Common Crawl's 50 billion+ web pages), these models—like GPT-4, Gemini, and Claude—acquire predictive power over syntax and semantics. They function as general-purpose sequence models, enabling critical applications such as complex content generation, language translation, and automated code completion (e.g., GitHub Copilot). Their core value: generalizing across diverse tasks with minimal task-specific fine-tuning.
- Gradient BangA high-performance CSS generator that crafts complex, multi-layered mesh gradients with zero-latency visual feedback.Gradient Bang streamlines the creation of sophisticated web backgrounds by leveraging a GPU-accelerated canvas to render 4K-ready CSS exports. Users manipulate organic color transitions through a drag-and-drop node system, bypassing the limitations of standard linear and radial syntax. The tool supports up to 16 simultaneous color stops and provides instant code snippets optimized for modern browsers (Chrome 110+, Safari 16.4+). By prioritizing mathematical precision over presets, it ensures every generated asset remains lightweight and fully responsive.
- DailyThe developer platform for real-time video and audio experiences.Daily provides the infrastructure for developers to build high-performance video calls and interactive broadcasts into any application. The platform handles the heavy lifting of WebRTC (global mesh networks, bandwidth management, and device compatibility) via a robust API and pre-built UI components. From 1:1 telemedicine consultations to massive 100,000-person interactive events, Daily delivers sub-250ms latency and 1080p resolution. Engineering teams at companies like Teamflow and Hotjar use Daily's SDKs to ship production-ready video features in days rather than months.
- Real-time voiceGPT-4o delivers sub-300ms latency for human-like vocal interaction and emotional intelligence.Real-time voice technology has hit a critical milestone with GPT-4o, achieving average response latencies of 232 milliseconds (matching human conversational pace). This system bypasses traditional three-step pipelines (transcription, processing, synthesis) by using a single neural network to handle audio input and output directly. The result is a fluid interface that understands tonal shifts, handles interruptions instantly, and expresses a range of emotions from laughter to singing. Developers are currently leveraging these capabilities via the Realtime API to build high-concurrency applications for customer support, language tutoring, and accessibility tools.