Loadingโฆ
๐ Overall Model Ranking composite = BLEU 35% + quality 40% + speed 25%
Loadingโฆ
๐ Recent Samples
Loadingโฆ
Source lang:
Accepted/corrected eval decides quality. Shadow drift measures disagreement, not truth.
Loadingโฆ
Loadingโฆ
Loadingโฆ
โ
Accepted Eval Accuracy against verified/corrected transcript pairs
Loadingโฆ
๐งช Shadow Drift provider output vs current primary transcript
Loadingโฆ
๐ท๏ธ Live Routing Badges recent voice rows
Loadingโฆ
๐ Recent Shadow Samples redacted comparison cards
Loadingโฆ
Target lang:
Source lang:
๐ Translation Quality by Model higher = better
Loadingโฆ
View by:
Which model is best per language?
Loadingโฆ
Model A:
vs
Lang pair:
Select two models to compare
Language:
Target:
Show:
๐ Samples โ click a row to expand translations
Loadingโฆ
Loadingโฆ
๐ Cleanup Pass Breakdown why was the 2nd pass skipped or run?
Loadingโฆ
๐ By Language avg length, bullets, improvement per language
Loadingโฆ
Recent Entries
Loadingโฆ
Status:
Language:
Loadingโฆ
๐ฏ Production Transcription Quality live message quality_score rows
Scores here come from Retena production message heuristics. Use ASR Models โ Accepted Eval for provider accuracy against verified/corrected transcript pairs.
Loadingโฆ
๐ How Production Quality Scoring Works
Each production transcription is scored 0โ100 using fast heuristics (no LLM). The score reflects how likely the transcript is accurate and natural.
โ
Positive Signals
โข +10 Word count โฅ 5
โข +10 Avg word length 3โ10 chars
โข +5 Has punctuation
โข +5 No looping patterns
โข +10 Realistic speech rate (0.3โ3.0 sec/word)
โข +10 Avg word length 3โ10 chars
โข +5 Has punctuation
โข +5 No looping patterns
โข +10 Realistic speech rate (0.3โ3.0 sec/word)
โ Negative Signals
โข โ20 Word count < 3
โข โ15 Avg word length < 2 or > 15
โข โ20 Repeated substrings (hallucination)
โข โ30 Contains [BLANK_AUDIO] / [INAUDIBLE]
โข โ15 Impossibly fast (< 0.1 sec/word)
โข โ10 Impossibly slow (> 10 sec/word)
โข โ15 Avg word length < 2 or > 15
โข โ20 Repeated substrings (hallucination)
โข โ30 Contains [BLANK_AUDIO] / [INAUDIBLE]
โข โ15 Impossibly fast (< 0.1 sec/word)
โข โ10 Impossibly slow (> 10 sec/word)
80โ100 Excellent
60โ79 Good
40โ59 Review
0โ39 Poor
Base score: 60 ยท Clamped 0โ100 ยท Pure heuristics, no LLM cost
๐ Production Quality by Language
Loadingโฆ
๐ Scored Production Messages
Loadingโฆ