Offline Language Translation Tool Capability Collaboration Event
11 June 2019
sofwerx.org/translator
Offline Language Translation Tool Capability Collaboration Event 11 - - PowerPoint PPT Presentation
Offline Language Translation Tool Capability Collaboration Event 11 June 2019 sofwerx.org/translator Blue Sky Working Group Brief Outcomes Team Votes Black 15 Blue 2 Purple 4 Red 1 Green 18 Orange 9 {Black} One-Year Product
sofwerx.org/translator
Team Votes Black 15 Blue 2 Purple 4 Red 1 Green 18 Orange 9
One-on-One, Two-Way Speech
1. Hardware
2. Software
Latency = 0.5s; CTR* = X 3. Language & Acoustics
Database population
4. Other
Surreptitious/Background (Future)
1. Hardware
2. Software
0.5s; CTR* = X
3. Language & Acoustics
population
4. Other
One-on-One, Two-Way Speech
1. Hardware
come out (refinements to software will only continue to increase computing load) 2. Software
MT BLEU > 25; Latency = 0.5s; CTR* = X 3. Language & Acoustics
testing/field data
more languages, or (B) increase sophistication on priority languages, or (C) pay a ton of $ and do both 4. Other
Surreptitious/Background
1. Hardware
come out (refinements to software will only continue to increase computing load) 2. Software
BLEU > 25; Latency = 0.5s; CTR* = X 3. Language & Acoustics
testing/field data
more languages, or (B) increase sophistication on priority languages, or (C) pay a ton of $ and do both 4. Other
▪ Select/Define Use Case ▪ Real time transcription/translation via audio ▪ Must address concerns for: a) Hardware Architecture– Form Fit Factor, Processing Ability a) Wireless ear piece: Near Field Magnetic Solution b) Handheld/Body - S7 ATAK/Galaxy Note 8 c) Centralized Processing Unit – KLAS VOYAGER; AWS Snowball/Outpost, etc. a) Bandwidth = 80MB (Trellisware/MANET) b) Computing power = 50 Watts
▪ Select/Define Use Case ▪ Real time transcription/translation via audio ▪ Must address concerns for: a) Software Architecture – AI Software Selection/Configuration w/HW a) Engine Selection for Transcription/Translation w/ appropriate mix of AI engines a) Via, Sage Maker, Veritone aiWARE for orchestration and algorithm management b) Identify key acoustic engines – open source & proprietary c) Identify key transcription/translation engines – open source & proprietary b) Language – Identify Set # of Languages to process for first year. a) Arabic, Spanish, Russian, Chinese c) UI/UX Definition – SW/HW a) Data Processing & Analysis, Training. b) In Theater/On Scene Interactions
▪ One on one, surreptitious/background/etc.? ▪ One on one – general collection/Ground Truth Classification/low noise ▪ Background Noise/Combination of Speakers – Preprocessing/ Adv Classification /De noise/Acoustic Eng
▪ Must address concerns for: a) Hardware – microphones –far field experience. Beam forming tech on consumer smartphones. Mini cloud. Finite languages (2 or 3?). USB for terabyte extension b) Software –multiple OS (android/iOS). Transcription tech to share. Data at rest encryption. Gather existing solutions. c) Acoustics– capture enough noise environments. Phonetics. d) Language- decide on metrics/product viability. Recording data for languages without existing data sources. e) Other – assess cognitive load of the Warfighter when using machine translation
▪ One on one at first
and software including Android/iOS using Google, Microsoft, iTranslate etc. Modern capable phone
increased (eg detect when interpreter is changing the message or missed something important) with machine translation.
languages with less data and begin improving them with acquisition of language data Stage 2: collect data from louder environments for key languages Stage 3: Measures of success – task completion %.
environments and background noise.
to improve the machine and assess accuracy of translations
language data
future
environment
dialects (local translators, language database via government, social media, television closed captioning)
common languages/dialects in region)
dialects
needs
Jetson AGX Xavier + Battery + storage Standalone unit for situations where other hardware is not available Microphone
foreign-language interlocutor
translate (in text)
radio/TV/etc. (including call-ins), which is being translated back to English
system can make its own choice from among those or others