Ai for adobaro
Technology Stack
Chatgpt
ChatGPT is a sibling model to InstructGPT, which is trained to follow an instruction in a prompt and provide a detailed response.
Translation
Because our products are aimed at foreign creator, so the first thing is translation. Due to Chatgpt has excellent translation effort, we chose it as out translation engine. We use it to translate information about creator things and srt file. In addition to it we will also use Deepl as a backup solution.
Example
Text Only | |
---|---|
Content generation
In order to adapt content uploaded by creator to China, we will use specific prompt to generate relevant data using chatgpt.
Example
Whisper
Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise and technical language. Moreover, it enables transcription in multiple languages, as well as translation from those languages into English. We are open-sourcing models and inference code to serve as a foundation for building useful applications and for further research on robust speech processing.
transcribe
Because the videos are all uploaded by foreign creator, compared to China, So making subtitles is a very difficult thing for them, So we introduced whisper. Whisper supports incoming parameters such as audio and model, it will automatically parse the source language and return the text with time series. We assemble legal srt files based on the returned json data using python. But there is an important step before that, which is translation, Because the main language used in China is Chinese, we need to use a translation engine to translate into Chinese before assembling