Computational Linguist (French) Text to Speech Are you Brave, Wise, Proud and ready to Exceed? Are you passionate, driven to reach goals and objectives, have excellent attention to details and have experience in a client focused environment? Covalen is a trusted outsourcing partner for leading global organisations. We're a diverse team of innovators and achievers – proud of our ability to consistently exceed goals.
Our client is a social networking company that operates on a worldwide level, engaging in the development of social media applications for people to connect through mobile devices, personal computers, and other surfaces.
THE ROLE As a Computational Linguist for Text to Speech, your role will be to work on improving TTS quality in the language of your expertise. These experts are needed to:
Make informed judgements of qualityImprove aspects of the pipeline (text normalization, pronunciation prediction)Pre-emptively identify problems specific to a new language, design test sets to illustrate these problems, and potentially help design solutions to those problems.DUTIES AND RESPONSIBILITIES Prior to Launch Create a regression test for each locale within their languageCreate and maintain a text normalization testset for their languageSource and vet datasets used in training of DD TN systems, and/or craft guidelines for external annotation programs used to generate those datasetsDevelop a set of text normalization rules for their language that guarantees certain accuracy against the testset (the TN rules are written in JavaScript)Create and maintain a pronunciation golden set for G2P evaluationIdentify/evaluate/solve language-specific pain-points, such as grammatical gender, word stress, segmentation, tone prediction, word case / declensionPerform targeted data quality checksAudio evaluationAfter Launch Fix all frontend bugs reported for the language via the methods in the Linguist Runbook, maintaining the regression test with each bugContinue improving text normalizationEnsure each deployment of the voice passes Capability Testing referenced in the Launch Review ProcessPerform ongoing Audio EvaluationCANDIDATE PROFILE Ideal candidate is a native or near-native speaker who majored/minored in linguistics, with some computational experience (or deep interest and willingness to learn).
Essential competencies needed for this role are:
Native or near-native (C1/C2) speaker of the market languageAdvanced/fluent (C1/C2) level of EnglishUndergraduate degree in linguistics or similarDemonstrated knowledge of International Phonetic AlphabetSome command line (Linux/Ubuntu) experienceOther competencies desirable for the role are:
Some Python experience preferredSome JavaScript experience preferredCOMPENSATION PACKAGE & BENEFITS Work from Home after training (first 2 months from the office in Dublin South)Performance bonusPrivate healthcarePension contributionTax Saver and Bike-to-Work SchemeFull training provided with career development opportunitiesBe part of a great, friendly, diverse teamWe can consider only applicants eligible to work full-time in Ireland. Thank you for understanding.
Keywords: Natural Language Processing, NLP, Large Language Models, LLM, Linguistics, Linux, Ubuntu, phonetics, International Phonetic Alphabet, artificial intelligence, AI
#J-18808-Ljbffr