Hackathon Portal
AI Tinkerers - Paris
Final round winners have been announced. View Results
Team

Cache me if you can

This team is at maximum capacity.

Project Concept

🧠 Project Overview

We’re building a mental-health oriented audio journal that becomes a live emotional companion powered by voice.

Today, millions of people talk to AI like they would to a friend – but most systems only analyze what you say (text), not how you say it (tone, rhythm, stress, pauses).
Our idea is to use ElevenLabs Conversational Agents + Audio Intelligence to create a companion that listens to you, talks back, and tracks your emotional state over time using prosody.


🎯 Goals

  1. Live voice conversations with an AI companion using ElevenLabs:
  • Natural, low-latency dialog
  • Turn-taking that feels like a real conversation
  1. Real-time prosody analysis of the user’s voice:
  • Detect variations in tone, intensity, speed, hesitations
  • Combine them with the transcript to estimate an emotional score
  1. Emotional timeline over days/weeks:
  • Each session contributes to a personal “emotional graph”
  • Users can see how their emotional state evolves over time
  • Always framed as self-care & awareness, not therapy

🔧 What We’re Building (Technically)

1. Voice Interface with ElevenLabs

  • Use ElevenLabs Conversational AI / Realtime API as:

    • The ears: streaming STT (transcription) and conversation events
    • The voice: TTS with expressive, controllable voices
  • The user speaks into a web or mobile client:

    • Audio is streamed to ElevenLabs
    • We receive live transcripts + timing info
    • Eleven’s TTS replies with a warm, adaptive voice

2. Prosody & Emotion Layer (Audio Intelligence)

  • In parallel, the same incoming audio is sent to our prosody service:

    • Extracts basic features (pitch, loudness, speech rate, pauses…)
    • Optionally uses an audio model/embedding for vocal emotion
  • We fuse:

    • How the person speaks (prosody features)
    • What they say (transcript, sentiment / intent)
  • This produces:

    • A real-time emotional index (e.g. 0–100)
    • A label like calm, stressed, sad, energized, etc.

This emotional index is:

  • Sent back to the agent as metadata to influence how the ElevenLabs voice should respond (more calm, more encouraging, more neutral).
  • Stored in the backend to update the user’s emotional timeline.

3. Emotional Journal & Dashboard

  • Every session is saved as:

    • Transcript (or key moments)
    • Emotional index over time (graph per session)
    • Notes / highlights (e.g. “big drop when talking about work”)
  • We provide a simple dashboard where users can:

    • See how their emotional index evolves across sessions
    • Spot recurring patterns (e.g. Sundays are always heavier, mornings vs evenings, etc.)
    • Optionally export anonymized stats

Entry

Status: Submitted

Last saved: November 15 at 6:12 PM CET

Team Roster (team is at max capacity)

Message board not available for this team yet.

Robin Quériaux Team Lead

Research Scientist at BNP Paribas
Led overall architecture, deployed the AI backend on Google Cloud using Vertex AI (Gemini for text understanding and reflection) and built the main frontend experience with Lovable, connecting it to ElevenLabs and the prosody/analytics services.
Research Scientist at BNP Paribas and Data & AI major at ECE Paris. Passionate about applied AI, human-centered technology, and ethical innovation. I build systems that connect research with real-world impact across finance, health, and education.
Agentic AI systems, emotion and prosody analysis, responsible AI governance, speech intelligence, blockchain for education credentials, and AI in mental health.
Currently leading an AI-powered emotional journal analyzing speech and prosody for well-being tracking; and La Certif, a blockchain-based credential platform for verifiable learning on Solana.

Arnaud Durand

Data scientist at BNP Paribas
Focused on the live voice output pipeline, wiring ElevenLabs Conversational / TTS APIs into the app and orchestrating the calls via n8n to handle streaming responses and voice style control.
Data Scientist @BNP Paribas | 42 Paris | I build full-stack AI & embedded systems (ESP32, Raspberry Pi, C/C++ + GenAI/RAG). Passionate about connecting hardware, data & intelligence. Always open to discuss new ideas, startups & collaboration opportunities!
Highly interested in the intersection of AI, health, and performance—both in tech and human physiology. Strong curiosity for applied machine learning, robotics, and embedded systems that bridge software with real-world impact. Also passionate about entrepreneurship, early-stage product building, and connecting with people working on innovative health-tech, AI applications, and hardware-driven projects.
Currently developing an embedded AI system for autonomous drones, focusing on real-time perception and on-edge decision-making. In parallel, building a web platform in the music space, exploring how data, user experience, and creative workflows can blend into meaningful digital tools.

Loan Perrard

Student at ECE Paris
Concentrated on the prosodic analysis of the user’s voice, designing the feature-extraction logic (pitch, intensity, speech rate, pauses) and integrating it into the emotional scoring pipeline that feeds the UI and agent.
Student at ECE Paris in 4th grade, I am participating in this hackathon to further develop my programming skills and abilities in vibe coding.
Application of Data Science especially in aerospace field
Fine-tuning LLMs to a better understanding of emotions through human voice.