AgentsIRL: Multimodal AI Hackathon with Google Cloud Run, Mistral, and NVIDIA GPUs
🧠 AgentsIRL: Multimodal AI Hackathon

**ft. Google Cloud Run, NVIDIA, and Mistral **
Build agents that see, hear, and think — then deploy them.
✨ Event Overview
Join us for AgentsIRL, a one-day technical sprint where AI builders will prototype real-world, multimodal agents that perceive and respond to audio-visual input — all deployed via Google Cloud Run, and accelerated with NVIDIA GPUs.

Developers love Cloud Run, Google Cloud’s serverless runtime, for its simplicity, flexibility, and scalability. Now, with NVIDIA L4 GPU support, Cloud Run offers a powerful environment for AI inference, perfectly suited for running a wide array of lightweight open models. This includes popular Open LLMs such as Gemma 3 1B/4B/12B, DeepSeek-r1 7B, and Llama 2 7B, as well as powerful open foundation models like Stable Diffusion XL (SDXL), Whisper, YOLOv8, and SAM.
![(Photo) An indoor event shows a speaker presenting to a large audience, with many attendees visible from behind. A large screen displays text and an image of a futuristic London skyline, suggesting a tech-related event. Text: AI Tinkerers London Ultimate Agents Hackathon Date: [unreadable] / Time: [unreadable] / Location: [unreadable] / Register: [unreadable] A speaker giving a presentation to an audience at an event. | Indoor, likely an event hall or conference room. | A large group of people, a man speaking at the front, a large screen displaying event information and a futuristic city image, potted plants, and light green curtains. | Candid, event photography, wide shot. | Colors: #EAEAEA, #829C74, #2F2F2F, #D0D0D0, #000000 Note: The image is a direct capture of a real-world scene, featuring people, an indoor environment, and a presentation, which are characteristics of a photograph.](https://sloppy-joe-app.imgix.net/blog_images/hackathon-london-png-5d8N.png?w=1400&auto=format%2Ccompress&q=85)
Photo: Our Hackathon in London last week
☝ Register now. Space is limited. ☝
🧪 Hackathon Ideas:
Teams can build agents that have great real life use cases. Examples:
Inspired by recent research directions in spatial reasoning and AV-LLMs (like SAVVY), this hackathon pushes teams to create agents that interpret real-world scenes and answer grounded, temporal questions such as:
“Where is the speaker located?”
“What caused the glass to fall?”
“Who just entered the room, and what did they say?”
Whether you’re working with Whisper, YOLOv8, SAM, challenge yourself to fuse audio, video, and inference into live demos that can run anywhere, anytime, and at scale
- Interpret live AV feeds from webcams or mobile devices
- Analyze videos for events, causes, and speaker locations Simulate embodied intelligence for games, security, or accessibility
- Multi-Image Comparison and Analysis
- Interactive E-commerce and Retail Experiences
Bonus challenges for:
- Most elegant Cloud Run deployment
- Best use of GPU acceleration
- Most useful real-world agent (robotics, accessibility, spatial UI, etc.)
☝ Register now. Space is limited. ☝
🗓 When & Where
🗓Saturday Oct 11 📍 In-person, Paris (venue on RSVP)
🧑💻 Limited slots, technical applications only
| Time | Activity | |
|---|---|---|
| 8:00 AM | Doors open · Registration | |
| 9:00 AM | Opening and Cloud Run Serverless GPU walkthrough | |
| 10:00 AM | Team-formation lock-in Session | |
| 12:00 PM | Lunch | |
| 1:00 PM – 6:00 PM | Mentoring available (sponsors & judges) | |
| 6:00 PM | Dinner break | |
| 7:15 PM | Project submission deadline · Judging starts ( live demos from team who make the final rounds | |
| 8:15 PM | Awards & closing ceremony | |
| 9:00 PM | Doors close |
🔗 RSVP
Get early access to the full event brief, starter containers, and judge lineup.
This event will fill quickly — RSVP now.
📊 AI Tinkerers Paris Stats
- Attendees: This exclusive community of 4,082 subscribers comprises elite technical professionals, with 42% specializing in AI/ML research, 31% in full-stack software engineering, and 27% in technical leadership. Members possess deep expertise in PyTorch, LLMs, and agentic architectures. Notably, the group features prominent open-source maintainers from Hugging Face and graduates from top-tier institutions like École Polytechnique.
- Companies Represented: Featuring tech giants like Google, Meta, and Microsoft, alongside open-source and AI leaders such as Hugging Face, Mistral AI, LangWatch, and Black Forest Labs, and innovative startups like ElevenLabs, Docker, Datadog, Algolia, and Zoom, and more
- Demos: Across 130 submissions, 83 demos have been presented at AI Tinkerers - Paris. The most exciting themes have focused on agentic production workflows, knowledge-graph-driven reasoning (GraphRAG/context graphs), and system architectures for scalable, reliable deployments. Technically, structured outputs and tool/function calling, cost/latency reductions, eval/benchmarking, and multimodal/local execution have been repeatedly explored.
- Testimonials:
☝ Register now. Space is limited. ☝
What’s Next
If you’re planning to sponsor future events or expand participation, you can find sponsor opportunities at: https://paris.aitinkerers.org/sponsors

Photo: AI Tinkerers - Paris