bAbI v2 AI Music Detection Model — Research Deep Dive

Published May 22, 2026 · 12 min read · AI Song Checker team

In May 2026, letssubmit publicly released bAbI v2, one of the few openly documented AI music detection models. The model card claims 87.67% accuracy on held-out test data — much lower than competing proprietary detectors but with full transparency. This article digs into the architecture, what it gets right, and where it falls short.

What is bAbI v2?

bAbI (Bayesian Authenticity-Based Inference, version 2) is a single-model neural network trained on a labeled dataset of AI-generated vs human-made music tracks. It outputs a probability score 0-1.

Architecture (per published model card)

Strengths

Limitations

Architectural comparison

SystemApproachAccuracyTrade-off
bAbI v2 (letssubmit)Single neural network87.67%Simple, transparent, lower accuracy
AI Song Checker v8.3Bayesian fusion of 82+ hand-crafted forensic signals99.1%High accuracy, interpretable signals, weekly recalibration
authioEnsemble of 12 specialized models99.42%Highest claimed accuracy, slower inference, opaque
IRCAM AmplifyResearch-grade CNN + spectral analysis~99%Enterprise-only, no public API

Why hand-crafted signals beat black-box NN here

For AI music detection specifically, hand-crafted forensic signals (MFCC, CPP, phase coherence, codec residuals) outperform black-box neural networks because:

  1. The signal types are physically grounded (we know why they work)
  2. They generalize to unseen engine versions (signatures persist across versions)
  3. They're interpretable (you can tell users which signal triggered the verdict)
  4. They recalibrate faster (no retraining needed, just signal weighting)

Future of public AI detection models

bAbI v2's main contribution is normative — it sets a precedent for transparency. We expect more research labs to publish similar model cards through 2027, especially as the EU AI Act requires "labeling AI content" mandates auditable detection methods.

For developers

If you're building your own AI music detector for research, bAbI v2 is a solid starting baseline. For production use cases (DSP, distributor, A&R platform), an ensemble or Bayesian approach (like ours) will outperform single-model architectures.

Related