How to convert dual alternating language audio clips to text using Python?












1















I am trying to convert some dual language audio clips to text. The clips start with English and then language changes to Deutsch and again it changes to English.



Below is my code:



import speech_recognition as sr

r = sr.Recognizer()
audio_file = sr.AudioFile('path_to_audio')

with audio_file as source:
audio_file_record = r.record(source)

print(r.recognize_google(audio_file_record, language='en-US'))
print(r.recognize_google(audio_file_record, language='de-DE'))


Problem is first print statement only prints the English audio part before the Deutsch and does not print English after the Deutsch.



How to get complete clip conversion with full text of both the languages?



I know i can set offset and duration to reach to specific part of a clip but then the solution will be specific to a clip which i do not want.










share|improve this question




















  • 1





    Can you get a timestamp for the start & end of the translated segment? If so, you could use that to parse the file in chunks.

    – SiHa
    Nov 16 '18 at 7:54













  • @SiHa, thanks for the idea. Not seen capturing timestamp as part of speech_recognition. Will try with timestamp before and after recognize_google call to see if it ends after first converted segment or waits till complete clip duration.

    – nAQ
    Nov 16 '18 at 16:46
















1















I am trying to convert some dual language audio clips to text. The clips start with English and then language changes to Deutsch and again it changes to English.



Below is my code:



import speech_recognition as sr

r = sr.Recognizer()
audio_file = sr.AudioFile('path_to_audio')

with audio_file as source:
audio_file_record = r.record(source)

print(r.recognize_google(audio_file_record, language='en-US'))
print(r.recognize_google(audio_file_record, language='de-DE'))


Problem is first print statement only prints the English audio part before the Deutsch and does not print English after the Deutsch.



How to get complete clip conversion with full text of both the languages?



I know i can set offset and duration to reach to specific part of a clip but then the solution will be specific to a clip which i do not want.










share|improve this question




















  • 1





    Can you get a timestamp for the start & end of the translated segment? If so, you could use that to parse the file in chunks.

    – SiHa
    Nov 16 '18 at 7:54













  • @SiHa, thanks for the idea. Not seen capturing timestamp as part of speech_recognition. Will try with timestamp before and after recognize_google call to see if it ends after first converted segment or waits till complete clip duration.

    – nAQ
    Nov 16 '18 at 16:46














1












1








1








I am trying to convert some dual language audio clips to text. The clips start with English and then language changes to Deutsch and again it changes to English.



Below is my code:



import speech_recognition as sr

r = sr.Recognizer()
audio_file = sr.AudioFile('path_to_audio')

with audio_file as source:
audio_file_record = r.record(source)

print(r.recognize_google(audio_file_record, language='en-US'))
print(r.recognize_google(audio_file_record, language='de-DE'))


Problem is first print statement only prints the English audio part before the Deutsch and does not print English after the Deutsch.



How to get complete clip conversion with full text of both the languages?



I know i can set offset and duration to reach to specific part of a clip but then the solution will be specific to a clip which i do not want.










share|improve this question
















I am trying to convert some dual language audio clips to text. The clips start with English and then language changes to Deutsch and again it changes to English.



Below is my code:



import speech_recognition as sr

r = sr.Recognizer()
audio_file = sr.AudioFile('path_to_audio')

with audio_file as source:
audio_file_record = r.record(source)

print(r.recognize_google(audio_file_record, language='en-US'))
print(r.recognize_google(audio_file_record, language='de-DE'))


Problem is first print statement only prints the English audio part before the Deutsch and does not print English after the Deutsch.



How to get complete clip conversion with full text of both the languages?



I know i can set offset and duration to reach to specific part of a clip but then the solution will be specific to a clip which i do not want.







python speech-recognition speech-to-text






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 16 '18 at 7:19







nAQ

















asked Nov 14 '18 at 11:39









nAQnAQ

463610




463610








  • 1





    Can you get a timestamp for the start & end of the translated segment? If so, you could use that to parse the file in chunks.

    – SiHa
    Nov 16 '18 at 7:54













  • @SiHa, thanks for the idea. Not seen capturing timestamp as part of speech_recognition. Will try with timestamp before and after recognize_google call to see if it ends after first converted segment or waits till complete clip duration.

    – nAQ
    Nov 16 '18 at 16:46














  • 1





    Can you get a timestamp for the start & end of the translated segment? If so, you could use that to parse the file in chunks.

    – SiHa
    Nov 16 '18 at 7:54













  • @SiHa, thanks for the idea. Not seen capturing timestamp as part of speech_recognition. Will try with timestamp before and after recognize_google call to see if it ends after first converted segment or waits till complete clip duration.

    – nAQ
    Nov 16 '18 at 16:46








1




1





Can you get a timestamp for the start & end of the translated segment? If so, you could use that to parse the file in chunks.

– SiHa
Nov 16 '18 at 7:54







Can you get a timestamp for the start & end of the translated segment? If so, you could use that to parse the file in chunks.

– SiHa
Nov 16 '18 at 7:54















@SiHa, thanks for the idea. Not seen capturing timestamp as part of speech_recognition. Will try with timestamp before and after recognize_google call to see if it ends after first converted segment or waits till complete clip duration.

– nAQ
Nov 16 '18 at 16:46





@SiHa, thanks for the idea. Not seen capturing timestamp as part of speech_recognition. Will try with timestamp before and after recognize_google call to see if it ends after first converted segment or waits till complete clip duration.

– nAQ
Nov 16 '18 at 16:46












0






active

oldest

votes











Your Answer






StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53299392%2fhow-to-convert-dual-alternating-language-audio-clips-to-text-using-python%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes
















draft saved

draft discarded




















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53299392%2fhow-to-convert-dual-alternating-language-audio-clips-to-text-using-python%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

List item for chat from Array inside array React Native

Thiostrepton

Caerphilly