Something that the community has been amazing at for each course is creating accurate transcriptions of each lesson. I’d love your help doing that for this year’s course too! If you have a few moments spare, could you transcribe some of a lesson? The way you do it is to watch the lesson on YouTube, open the auto-generated transcript (linked below), and correct the auto-generated version so it matches what you hear, and also is reasonably correct for punctuation.
You can do this directly in these shared Google docs I’ve created:
The title of each doc is a link to the video. The sections that are in italics are auto-generated. So once you fix a paragraph, remove italics from it (select it and press ctrl-i). So I can credit you in the YouTube description, also add your forum username above it. You’ll see I’ve done the first few paragraphs of the first few lessons, to give you a sense of what it should look like.
In general, change the wording as little as possible – the transcript should say what I say in the video. Google syncs the transcript to the video automatically, so if the text is changed, it won’t be able to sync it.
I’ll add credits to the video description on YouTube for everyone that helps.
A suggestion…
This will require you to manually remove the forum-username from the document.
An alternative is to highlight the text, then click Add comment, and add of forum-username there. Then you can use the final text directly, without further editing. The other advantage is the yellow highlight makes it really clear what is complete. I find italic-versus-normal-font doesn’t have enough contrast when quickly skiming whole documents
When someone finishes a part, to help the next person jump to the right video location…
right click on the video, then select Copy video URL at current time
That’s fine too - you’re welcome to do whatever works for you!
If you’re ever unsure whether you’re looking at auto-generated or manual transcription, then look for capital letters. Google’s auto-transcribe doesn’t use capitals.
I wonder if we could keep both auto-generated and manually edited versions to compare, or maybe fine-tune the recognition software to improve the automated results?
Not sure if it makes much sense though, just got some thought about “residuals” and boosted learning.
Update
Ah, ok, so these subscripts were provided by YouTube/Google. So I guess there is no much to do about the model.
Thanks all for your help so far with the transcripts!
I’ve just added the lesson 3 auto-generated transcript to the top post of this thread.
PS: Where possible, it’s best to finish the lesson 1 transcript before moving onto lesson 2, and so forth. That way we get finished transcripts uploaded and usable ASAP!
I am quite surprised that the autogenerated transcripts do generally pretty well. Except for proper nouns. So, pie torch, caggle etc. Interestingly it got tensorflow correct. Google may have been sneaky there.
I’m starting with the last paragraph of Lesson 3. I didn’t see this mentioned but it’s much easier than copying and pasting timestamps to find where one person stopped and another picked up. There’s a “Show transcript” option that’ll allow you to search for text. Hope this helps us all work in parallel!
Another tip for those of you with 2 monitors is to have the doc on one screen and the video on the other. Makes editing a lot quicker!
That’s more for my own record keeping. It’s easier to find where I started/stopped if I don’t finish a block in one sitting (when I’ve blocked off 2-3 pages at a time.) I personally find it a faster way to go back to the video location where I may have left off. At the end I take all of these out of course and submit the whole block as one “change”.