Last semester, as I spend untold hours editing the closed captioning automatically generated by YouTube on the math videos on my YouTube channel, I got a crash course on the capabilities and limitations of this system. This crash course was perhaps not legally necessary but extra work that I took on because a student with a hearing impairment was enrolled in my class, and I wanted to ensure that the review videos that I provide to my students were accessible to him also.
I think the resources offered by my university are fairly typical to ensure that instructors are able to reach all students and not just those who don’t have audio/visual impairments. After discussions with the cognizant people at my university, I’ve made a few conclusions:
- Mostly by accident, my videos are ADA compliant since I made the decision to both write out the solutions and also talking through the solutions.
- While the automatic closed-captioning provided by YouTube may be minimally compliant with ADA, I’m not sure that a student with a hearing impairment could always follow the transcriptions due to a number of errors.
- Aside from punctuation, capitalization, and the occasional homonym (e.g., right vs. write), YouTube does a pretty good job at transcribing ordinary speech.
- Naturally, YouTube’s automated closed-captioning is not to blame when I don’t enunciate clearly, have a rabbit trail of thought but then have to backtrack, use poor grammar, make a outright mistake, etc.
- However, YouTube seems to have a lot of difficulty providing automatic closed-captioning of mathematical speech.
Fixing these transcription errors took an awful lot of time. I don’t want to know how many hours I devoted to fixing the 120 or so videos (each video is about 3-10 minutes long) recorded so that my hearing-impaired student could have full access to my class. About halfway into this project of fixing the closed-captioning errors, I started writing down some of the closed-captioning errors. I wish I had thought to do this near the start, but oh well.
Phonetically, I can understand why most of these errors were made. But these mistakes really shouldn’t have happened. Here are my favorite howlers that I recorded, showing both what I said and what YouTube thought I said.
- “931,147,496” became “930 1,000,000 147,000 496”
- “
,” pronounced “
intersect
,” became “A inner sexy”
- “arithmetic” became “rhythm sick”
- “capital
” became “Catholics”
- “cardinality” became “carnality”
- “divisible by 5” became “visited his wife live” (I have no idea how that happened)
- “
” became “eat ooh the x”
- “for succinctness” became “force the sickness”
- “
,” pronounced “
choose
,” became “and shoes and”
- “set containing” became “second taining”
- “
” became “squirt tuna”
- “two ways in” became “too wasted”
- “what
,” pronounced “what
of 3,” became “whateva 3”
, pronounced “
is in
,” became “sexism be”
, pronounced “
is in
and
,” became “x is Indiana see”
, pronounced “
is in
,” became “excellency”
Here’s the complete list of howlers that I recorded for posterity. If I’ve learned nothing else, it’s that I need to be more proactive about ensuring the mathematical accuracy of closed-captioning for my YouTube videos.
| 4 | for |
| 857 | a 50 7 |
| 1232 | 1230 two |
| 4761 | 4760 1 |
| 19,999 | 19,000 999 |
| 46,376 | 40 6376 |
| 123,552 | 120 3,552 |
| 5,565,120 | five million 565,000 120 |
| 931,147,496 | 930 1,000,000 147,000 496 |
| 2d sent | |
| 28 | |
| one too | |
| 12 juice 4 | |
| 16 choosing | |
| surplus one mix for | |
| 4 2 0 | |
| four twos k | |
| 49 she’s 5 | |
| 52 six | |
| a choose to | |
| A inner sexy | |
| a intersecting | |
| a you be | |
| a UNC | |
| a you will see | |
| a proof | approved |
| a compliment | |
| asa by | |
| all multiples of | almost visit |
| an element of |
known the debate |
| an element of |
normal today |
| and divisible | and as above |
| and positive 50 | + + 50 |
| and tens | intense |
| and would let this be 3 | andrew lippa p3 |
| arithmetic | earth to |
| arithmetic | rhythm sick |
| ace | |
| be but not si | |
| b in a sexy | |
| beef | |
| bijection | bi CH action |
| bijection | bite jection |
| bijection | by dejection |
| bijection | by ejection |
| bijection | by jection |
| bijection | by Junction |
| both sets | both says |
| capital X | Catholics |
| cardinality | carnality |
| Cartesian | car to shull |
| codomain | code Amin |
| coordinate | cordon |
| coordinate | court |
| coordinates | corners |
| coordinates have | cort in sap |
| cosine | cosign |
| disjoint | destroyed |
| divisible by 5 | visited his wife live |
| eat ooh the x | |
| element of A | illness of A |
| element of A | mellow today |
| element |
that Windex |
| elements | of us |
| empty | MQ |
| descent | |
| intercept | |
| equal | able |
| exponent | x1 |
| factored | acted |
| factorial | fact welders |
| fill in | film |
| flipping four coins | philippine for coins |
| for succinctness | force the sickness |
| hence in | Hanson |
| eye | |
| aye | |
| If I divide by 15 | If I / 15 |
| in |
nae |
| in there | a bear |
| infinite | if an |
| infinite | imp an |
| infinite | infant |
| into five | in 2 5 |
| ice | |
| j choose arms | |
| cave | |
| kate | |
| likewise | lakh wise |
| and shoes and | |
| nth throw | |
| one-to-one | 121 |
| onto | on 2 |
| our shoes are | |
| art at | |
| already | |
| are too | |
| our too | |
| hours | |
| same row | samro |
| second coordinate | sec cornered |
| set containing | second inning |
| set containing | second taining |
| set containing | seconds hanging |
| set containing | secretary |
| set containing 1 | second anyone |
| since |
say has |
| sixth one | six-month |
| square | swear |
| score 2 | |
| squirt of tuna | |
| team A | teammate |
| term in it | terminate |
| than zero | gloves are off |
| that’s chosen | that’s Showzen |
| then |
the next |
| therefore | there for |
| this entry in | the century plus |
| to the |
decay |
| two are | to are |
| two ways in | too wasted |
| union | you need |
| up here | pier |
| what |
whateva 3 |
| will be 4 | will before |
| with |
finials 4 |
| would subtract | was attract |
| writing | riding |
| extras | |
| exiting | |
| x as a native | |
| x is nay | |
| sexism be | |
| x is Indiana see | |
| excellency | |
| X’s and see | |
| next to | |
| text too | |
| export | |
| why | |
| wine | |
| wider | |
| wise |