EN AR FA

VOX

One App. All AI Models. Every Voice.

4
AI Models
35+
Languages
$0
To Start

Built for Google Cloud + ElevenLabs Hackathon 2025

The Problem

AI is fragmented.
Voice is an afterthought.

ChatGPT

1 model, 1 voice, text-first

Claude

1 model, no voice

Gemini

1 model, limited voice

You can't use the best model for each task.
You can't personalize how AI sounds.
That's the gap.

The Solution

Your AI contacts.
Their voices.

  • Multi-model — Gemini, Claude, GPT-4, DeepSeek in one app
  • Voice cloning — Hear yourself speak any language
  • AI contacts — Each has unique voice, personality, purpose
  • Real-time translation — 35+ languages, your voice
Alice
Interview Coach • GPT-4
I have a Google interview next week. Help me prep?
Great! Let's start with a behavioral question. Tell me about a time you led a project through ambiguity...
Voice message (0:23)
Good structure! I'd rate that 7/10. Let's work on adding more specific metrics...
Timing

Why now?

Voice AI Got Good

ElevenLabs is now indistinguishable from human

Multi-Model Era

No single AI wins everything. Users want choice.

Gen Z is Voice-Native

They grew up with Siri. Voice notes > texting.

Costs Dropping

Voice AI is finally economically viable

2 years ago: Too early   →   Now: Window open   →   2 years later: Too late

Industry Validation

"Kids love Voice Mode"

Sam Altman
Sam Altman
CEO, OpenAI

"Kids love voice mode in ChatGPT."

"My kids will never be smarter than AI, but they will grow up vastly more capable than we grew up."

The Thomas the Train Story

A parent put ChatGPT in voice mode with his kid...

2h
Still talking
10K+
Words

"My son thinks ChatGPT is the coolest train-loving person. I can never compete."

Differentiation

What makes us different

1

Multi-Model

Use Claude for reasoning, Gemini for speed, GPT for coding. All in one interface.

2

Voice-Per-Contact

Alice's voice = interview mode. Carlos = Spanish mode. Voice triggers instant context.

3

Group AI Chat

Put Claude, GPT, Gemini in one chat. Watch them debate. Get the best answer.

No one else offers this combination.

Live Product

What we built

  • Multi-model AI (Gemini, Claude, GPT, DeepSeek)
  • Voice cloning (ElevenLabs integration)
  • Real-time translation (35+ languages)
  • Pre-made AI contacts (7 experts)
  • Custom contact creation
  • Subscription payments (Stripe)
  • Cloud sync across devices
Carlos
Spanish Tutor • Claude
¿Cómo estás hoy?
¡Muy bien! Tu pronunciación mejora cada día. Practiquemos el subjuntivo...
Marcus
Startup Mentor • Gemini
How should I price my SaaS?
Start with value-based pricing. What's the ROI for your customers?
Market

Market sizing

Bottom-Up Calculation

People who pay for AI tools ~50M globally
× Want voice-first experience ~10% = 5M
× Will try a new app ~20% = 1M
× Will convert to paid ~5% = 50K
× Average revenue per user $15/month
Year 1
$750K - $2M
ARR
Year 3
$20M - $50M
ARR
Revenue

How we make money

Free
$0

Basic model
Limited contacts
Default presets only

Max
$200

Unlimited everything
Priority access
Early features

To be calculated

ElevenLabs cost per character, LLM API costs, free tier sustainability. This is MVP — we built the product, now we validate the business.

Competition

Competitive landscape

Feature ChatGPT Claude Gemini Character.AI VOX
Multi-model access
Voice-per-contact
Voice cloning
Real-time translation Limited
Group AI chat Soon

Window: 12-18 months before big players catch up

Traction

What we need to prove

01

Voice-Per-Contact Aids Memory

Do users recall context faster with unique voices?

Validation: 100 user study
02

Multi-Model Matters

Do users actually switch between models?

Validation: Usage analytics
03

Users Will Pay

Free to paid conversion rate

Target: 5%+ conversion

These aren't assumptions. These are experiments we're running.

Team

Why us?

10+ years building products people actually use and return to

20M+
Downloads
50%
D1 Retention
#1
US App Store
10+
Years Experience
Published with Voodoo
100+ prototypes tested in US
Zero to launch, multiple times
Roadmap

What's next

Month 1-3

Validate

  • → 100 user hypothesis testing
  • → Calculate unit economics
  • → Voice cloning safety features
  • → Security audit
Month 4-6

Prove

  • → Launch Group AI Chat
  • → D7 retention > 25%
  • → Free→paid > 5%
  • → Optimize API costs
Month 7-12

Scale or Pivot

  • → If metrics work: Scale
  • → If metrics fail: Pivot
  • → B2B only if B2C proven
Investment

Raising $500K

12 months runway to reach 1,000 paying users

Use of Funds

Production $200K
AI & Voice API costs $150K
User acquisition $100K
Operations & legal $50K

Milestones

M3 Public launch, 100 paid users
M6 Group AI Chat, 500 paid users
M9 Mobile app, 750 paid users
M12 1,000 paid users, Series A ready

VOX

What we know

  • The product works
  • The technology exists
  • The timing is right

What we'll prove

  • Users retain and pay
  • Voice-per-contact works
  • Unit economics viable

We're not promising the next unicorn.
We're promising to find out if voice-first multi-model AI is a business.

If it is, we're first.

Try the Demo →