How We Built the Safest Kids App — 1,327 Tests, 9 Global Ratings, 100% Safety
Hey Minie scores 100% on child safety benchmarks, is rated safe by every major global authority, and uses three independent safety layers. Here's exactly how.
Quick verdict
Hey Minie scored 100% on the MinorBench child safety benchmark (ICLR 2025), outperforming ChatGPT (97%), Gemini (96.3%), and Claude (87.3%). It's rated safe for all ages by 9 global content rating authorities including IARC, ESRB, PEGI, and more. Safety isn't a feature — it's the foundation we delayed our launch by a year to get right.
Why this matters
When you hand your child a device with an app on it, one question matters above everything: is it safe?
Most kids' apps aren't built for children. They're built for adults and given a "kid-safe" label. We built Hey Minie for our own kids. That means before a single child used it, we threw everything we could at it — 1,327 test cases, adversarial attacks, and real-world conversations that push every boundary a curious 3-year-old (or a boundary-testing 9-year-old) would push.
Part 1: Rated safe by 9 global authorities
Hey Minie has been certified safe for all ages by the International Age Rating Coalition (IARC) and every major regional content rating authority in the world:
IARC 3+ (Global)
ESRB (US/Canada)
PEGI (Europe)
USK (Germany)
ClassInd (Brazil)
GRAC (S. Korea)
ACB (Australia)
Russia 0+
Taiwan 0+These ratings mean that Hey Minie has been evaluated against the content standards of every major market in the world — and passed with the safest possible rating in each one. Not "safe with parental guidance." Not "safe for ages 7+." Safe for all ages, no restrictions.
Part 2: 100% on child safety benchmarks
Content ratings tell you what's in the app. Benchmarks tell you how it behaves. We tested Hey Minie against three different safety benchmarks — 1,327 test cases in total:
The benchmarks
- MinorBench (299 prompts) — Created by GovTech Singapore, published at ICLR 2025. Hand-built to test AI systems on child safety across six categories: sexual content, profanity, hateful speech, dangerous activities, self-harm, and substance use.
- Kora (737 scenarios) — Broader safety evaluation covering edge cases, ambiguous requests, and content that's technically safe for adults but inappropriate for young children.
- Behavioral Farm (291 test steps) — Full conversation simulations. A child asks for a story, then says something unexpected mid-story. A child switches languages. A child says "I love you, marry me" (normal for a 5-year-old). Each scenario tests whether Hey Minie responds naturally, warmly, and safely.
The results
| System | MinorBench Safety Rate |
|---|---|
| Hey Minie | 100% |
| GPT-4o-mini (OpenAI) | 97.0% |
| Gemini 2.0 Flash (Google) | 96.3% |
| Llama 3.3 70B (Meta) | 92.6% |
| Claude 3.5 Haiku (Anthropic) | 87.3% |
| o3-mini (OpenAI) | 76.6% |
Hey Minie didn't just pass — it scored higher than every major AI system on the market. And these aren't small models. GPT-4o-mini, Gemini Flash, and Claude are the AI systems that power products used by hundreds of millions of people.
Per-category breakdown
| Category | Hey Minie | GPT-4o-mini | Gemini | Claude |
|---|---|---|---|---|
| Dangerous Activities | 100% | 94% | 94% | 62% |
| Self-harm | 100% | 96% | 90% | 86% |
| Hateful Speech | 100% | 100% | 98% | 94% |
| Substance Use | 100% | 100% | 100% | 100% |
| Profanity | 98% | 92% | 96% | 82% |
| Sexual Content | 96% | 100% | 100% | 100% |
Part 3: How it works — three layers of safety
Most apps rely on a single safety layer — the AI model's own training. We don't trust any single layer. Hey Minie uses three independent safety systems, and all three must pass before your child hears anything.
Layer 1: Smart instructions
The AI model is given detailed safety guidelines — what to say, what not to say, how to handle sensitive topics. It knows that "let's fight dragons" is pretend play (safe) and "jump from the roof" is dangerous (redirect). It knows that body curiosity questions are normal for a 3-year-old (redirect to parents, don't explain).
Layer 2: Hardcoded rules
Even if the AI ignores its instructions (which can happen with any AI), a second layer of deterministic pattern matching catches dangerous content — references to heights, sharp objects, fire, traffic, hiding places, and more. This layer cannot be bypassed by clever prompting because it doesn't use AI — it's simple, fixed rules.
Layer 3: Independent safety guard
A separate, dedicated AI model (completely independent from the conversation) reviews every message. It's specifically trained to detect child safety violations: grooming patterns, self-harm content, and age-inappropriate material. It also sees the last 5 messages to catch multi-turn escalation — where individual messages seem fine but the pattern is concerning.
What "safe" actually means for a kids app
Safety for a children's app isn't just about blocking bad words. The hard part is knowing the difference between a child being a child and a genuine safety concern.
Normal child behavior — should NOT trigger blocks
- Pretend violence: "Let's fight dragons!", "I'll be the superhero!"
- Potty humor: "poop", "fart", "butt" (completely normal for ages 3-8)
- Frustration: "you're stupid", "I hate you" (not a safety issue)
- Affection: "I love you", "marry me" (normal for young children)
- Body curiosity: "why do boys and girls look different?" (redirect to parents)
Actual safety concerns — MUST be caught
- Content that could lead to physical harm
- Sexual content or grooming patterns
- Content promoting self-harm
- Hate speech targeting protected groups
Blocking everything makes the app useless. Blocking nothing makes it dangerous. Getting this right requires understanding how real children actually talk — and we have 38,000+ real child-AI conversation turns in Indian languages to learn from.
How to check if any kids app is safe
Whether you use Hey Minie or not, here's how to evaluate any kids app:
- Check the IARC/content rating — On Play Store, scroll down to "Content rating." On App Store, check "Age Rating." Look for the lowest rating (3+ or Everyone). If it's 12+ or higher, it's not designed for young children.
- Check the privacy policy — Does it collect voice data? Does it share data with third parties? Is it COPPA compliant? If the privacy policy doesn't mention children specifically, it probably wasn't built for them.
- Test it yourself first — Try saying things a child would say. Say something weird. Try to get it to say something inappropriate. If it breaks easily, your child will find the same cracks.
- Ask: was it built for children, or adapted for them? — Apps built for adults with a "kids mode" bolted on are fundamentally different from apps designed for children from day one.
We delayed our launch by a year for safety
Hey Minie was supposed to launch in 2025. We pushed it by a full year because the safety systems weren't where we wanted them. When it comes to children, "good enough" isn't enough. We refused to ship a product that we wouldn't trust with our own kids.
We're parents building this for our own children. Safety isn't negotiable.
Frequently asked questions
Is Hey Minie safe for my child?
Yes. Hey Minie scored 100% on the MinorBench child safety benchmark (ICLR 2025), outperforming ChatGPT, Gemini, and Claude. It's rated safe for all ages by 9 global content rating authorities. Three independent safety layers check every response before your child hears it.
What safety benchmarks has Hey Minie been tested against?
1,327 test cases across three benchmarks: MinorBench (299 prompts, GovTech Singapore/ICLR 2025), Kora (737 scenarios), and Behavioral Farm (291 conversation simulations). 100% safety rate on MinorBench.
How does Hey Minie compare to ChatGPT for child safety?
On MinorBench, Hey Minie scored 100% vs GPT-4o-mini at 97%, Gemini at 96.3%, and Claude at 87.3%. Hey Minie uses three independent safety layers while general-purpose AI models typically use one.
What content ratings does Hey Minie have?
IARC (3+), ESRB (Everyone), PEGI (3), USK (0), ClassInd (Livre), GRAC (All), ACB (General), Russia (0+), and Taiwan (0+). The lowest possible ratings — no restrictions, safe for all ages.
How do I check if a kids app is safe?
Check the content rating on the app store (3+ or Everyone), read the privacy policy for children-specific protections, test it yourself, and ask whether it was built for children or adapted from an adult product.
Built for your child's safety
100% on safety benchmarks. Rated safe by 9 global authorities. Three independent safety layers. Free to try.
Share this article
