A Systematic Comparison of Data Selection Criteria for SMT Domain Adaptation
Data selection has shown significant improvements in effective use of training data by extracting sentences from large general-domain corpora to adapt statistical machine translation (SMT) Car Audio Power Wiring systems to in-domain data.This paper performs an in-depth analysis of three different sentence selection techniques.The first one is cosin