Could you edit this post to link to the recording as well?
Youâll find the video here!
A few notes:
I wasnât sure how much to edit it for grammar, sense, and flow. I just used my judgement (Iâve worked as a writer and editor in various contexts). For instance where you said:
âBut you can see so if you train this for a few minutes, itâs nearly perfect.â
I changed to:
âBut you can see that if you train this for a few minutes, itâs nearly perfect.â
Itâs less conversational and has a little less of the implicit âHey help I need editing in order to read well!â Donât know if you want that or whether you want it left, so to speak, warts and all. I also added a couple of small screenshots where you were pointing to code, just so itâs more comprehensible and whole.
edit: based on @jeremyâs comments I removed the screenshots.
In the document preferences I turned off âautomatically capitalize wordsâ because itâs irritating to constantly have to (for instance) undo the automatically capitalized Untar into untar.
Iâm omitting the utterance âummâ ⌠assuming that is the right thing to do? i also ditto @quantumâs question as well.
Questions for Jeremy:
-
so this will be in Markdown. That means, if we are referring to code, we should put it in markdown format. Example:
import *
. Is this right? [bold should be bold, italics] -
I separated mine by paragraphs, makes it easier to read. Is that ok?
-
It seems like it would be a good idea if people retained their time stamp in the google doc, instead of deleting it after they have reviewed the transcript. Is this correct?
Iâm not confident that we should be removing the âumâsâ and the âlikeâsâ and the repeated words and the misspoken words.
From a machine learning perspective, if anyone were going to use these transcriptions as labeled training data, they would be very frustrated to find editorial license being taken.
From a human consumption perspective, I would say edit away!
Thanks for checking - best to keep the words exact as I said (but remove âummmâ and similar), so that YouTube can sync it to the video correctly.
No, just plain text please.
Yes.
I donât mind either way.
Yes leaving most stuff in is best, although âummâ and âlikeâ can be removed.
If itâs going to be syncâed to the video, maybe we could make a style guide for our use, so that the viewer gets a consistent experience.
Staying consistent with other editors, I used italics for function names. But some editors use italics and parentheses, some just italics. Sometimes entire code blocks are inserted.
Sometimes bolding is used â when should we bold?
For consistency, maybe we could make a glossary â for words like ResNet, Jupyter, fastai, Python, README âŚ
Itâs plain text in the end, so no formatting will end up in the final result.
Code blocks shouldnât be inserted - the transcription should only contain what I actually say.
A glossary could certainly be added to the top post if anyone would find that helpful - thanks!
Hi @jeremy,
sorry I was a bit late, the situation here is complicated (but getting better!). So, I actually kept all the âlikeâ and ârightâ (no âummmâ in my part ) but I can quickly remove them if you think that would help for YouTube syncing. Anyway, thereâs a piece at 51:12 which I couldnât parse no matter how much I tried: I highlighted the text, so could you please have a look into it?
Tip for transcribers: I found it useful to slow down the video and have the video window and the Google Doc window side by side. BTW, Google Docs is a great collaboration tool! Really good UX
No thatâs fine - I donât mind much either way
I had some extra time so I did some extra ones I hope you lot donât mind
Somehow we should make sure they are all in the same format. So they should all include âlikeâ âummâ or not⌠Same kind of punctuation. Capitalisation of words like Jupiter, Python or notâŚ
I am also not really sure about where to put commas. I would appreciate if someone could double check it. But of course comma style is also part of the syntax of the whole so this should be done after that is decided?
Looks like the video already has captions, and they are very close to being correct. Not sure how that happened âŚ
Hadus,
Glossary is a great idea! Things like Python, Jupyter, etc. need to be consistent.
I looked yours over ⌠biggest change was that I used the American (rather than UK) spelling (to remain consistent), and transitioned some semicolons to commas.
Auto generated by YouTube. I tried looking at them while transcribing but it didnât increase my productivity so I turned the captions off.
Also 0.75 playback speed on the video really helped.
How are we to deal with changing speakers? If this is going to be automatically syncâed to the video, not sure that it makes sense to include a name before the speaker (e.g., when dialog is ongoing between Jeremy and Rachel).
Should we just ignore changing speakers and simply include the audio text?
Yes please.
@transcribe-1v4: Just wanted to post a quick note to say - you folks are absolute heros! Thank you so much for your great work on this!
Happy to get stuck into lesson 2 transcription, is there a google doc available yet?