Hi. This is my response to the Community Participant from Landmark College who contributed this method for obtaining transcripts from the downloaded str files of the CCs generated by Canvas Studio. Thank you for recognizing this need, and for the need for an efficient way to do this. I haven't tried out your method yet, but I hope it will be effective. I also found a 3rd party app, which I also haven't tried, but which is a Chrome add-on and promises to accomplish this same function. And it was suggested to me that if I am recording my own lecture via an audio recording feature on my smartphone, that I would be better off using a speech-to-text app which would bypass the need to generate CCs. I originally had the idea of running an audio recording through Studio, generating CCs, downloading them, then editing that file into a wrapped text for a readable transcript, which is much more efficient timewise to process than an audio lecture. I did initially try to copy paste the srt file into a Word document, then do a global search and replace function for each digit successively (i.e., 0-1-2, etc.). That eliminated all the numerals, but not the symbols or the punctuation that remained as vestiges of the time stamps. I believe there is a way to remove hard breaks in this search and replace function, and I was able to copy and paste the symbols (horizontal lines of some sort) and eliminate those, but I was reluctant to eliminate the punctuation marks, because that would also eliminate them in the CC text, and they are needed there. Anyway, I wonder how TED.com does it, because they seem to generate readable transcripts. I have also had my ESL students copy and paste transcripts of their own words, as represented by the CCs, from their own video or audio recordings, into the website tophonetics.com, where they can obtain a rendering of their text in IPA (International Phonetic Alphabet), as well as read aloud by an automated native English-speaking voice reader. This is quite empowering and instructive to them, because they have generated their own words from their own thoughts, rather than imitating words produced by others. Once they hear their own words read by a native English speaker, they can adjust their own speech to attain more clarity. Also, I have been looking for a way to produce CCs from a CC generator which does not have a semantic or syntatic processor, which could reproduce more accurately their produced speech on a phonologically processed level alone. This would provide them with visual feedback to help them perceive where their English speech production lacks clarity. I initially thought of this method, and sometimes it does work, but the CC processors now apparently incorporate multiple levels of linguistic processing (i.e., syntactic and semantic) which compensates for lack of clarity at the phonological level. Where could I find CC generators which reproduce the phonological level alone, so that speakers could get visual feedback to help them become more aware of phonological production clarity issues in their own speech?
https://subtitletools.com/convert-subtitles-to-plain-text-online
This discussion post is outdated and has been archived. Please use the Community question forums and official documentation for the most current and accurate information.