ERC Starting Grant: Domain Adaptation for Statistical Machine Translation (DASMT)
Rapid translation between European languages is a cornerstone of good governance in the EU, and of great academic and commercial interest. Statistical approaches to machine translation constitute the state-of-the-art. The basic knowledge source is a parallel corpus, texts and their translations. For domains where large parallel corpora are available, such as the proceedings of the European Parliament, a high level of translation quality is reached. However, in countless other domains where large parallel corpora are not available, such as medical literature or legal decisions, translation quality is unacceptably poor.
Domain adaptation as a problem of statistical machine translation (SMT) is a relatively new research area, and there are no standard solutions. The literature contains inconsistent results and heuristics are widely used. We will solve the problem of domain adaptation for SMT on a larger scale than has been previously attempted, and base our results on standardized corpora and open source translation systems.
Our work will lead to a break-through in translation quality for the vast number of domains with less parallel text available, and have a direct impact on SMEs providing translation services. The academic impact of our work will be large because solutions to the challenge of domain adaptation apply to all natural language processing systems and in numerous other areas of artificial intelligence research based on machine learning approaches.
Funded by the European Research Council
News
- I received an ERC Proof of Concept grant "Data for Multilingual Learning (DATA4ML)"! This will create a real world demonstrator for the work we did in my ERC Starting Grant on Domain Adaptation. The main focus here is on creating a tool to help minority language activists find and create multilingual data for training machine translation systems and language models for their languages, but our work will enable the creation of domain-specific models for companies as well. Here is the press release in English and German.
- I was interviewed by the Handelsblatt newspaper about Facebook's open source AI strategy, which I strongly support. We have used Facebook models in many of our research projects, such as studies on cultural moral norms in LLMs and on transfer learning for cross-lingual hatespeech detection. Excerpts from the interview (in German).
-
Lavine and Alexandra have a paper in Findings of EMNLP on using adapters for multilingual multi-domain NMT. Viktor and Saadi have a paper at EMNLP on improving low resource languages in pretrained multilingual language models. Saadi, Viktor and Tobias Eder have a paper at the Multilingual Representation Workshop at EMNLP on comparison of cross-lingual contextual embeddings. Sophie also has a paper on some previous work on modals in Findings of EMNLP. Nice work everyone!
-
Congratulations to Lavine and Jindrich for a nice paper at Coling 2022 on domain robustness and domain adaptability.
-
Sotra, the Sorbian translator now handles Lower Sorbian as well! Dario and I were both involved in the modeling work. Here is a news video and a short press release (in Lower Sorbian and German)!
-
The ERC published a short article about my ERC Starting Grant, appropriately enough in 6 languages. See here.
- Demo at VLDB with Carsten Binnig's group on conversational agents. See Gassen et al, here.
-
Keynote at BUCC 2022, attending LREC in Marseille was fun. Here are the slides, the paper on Pseudo Parallel Corpora for OOVs, Huck et al 2019, and Hangya et al 2021, the paper on Translating Unseen and Rare Senses.
-
I gave an invited talk at the workshop AI in Science: Foundations and Applications on June 9-10, 2022 at the MCMP at LMU Munich. The topic was "Some Open Problems in (Multilingual) Natural Language Processing". Here are the slides.
-
Looking forward to hearing the official word soon about the Munich Center for Machine Learning (MCML) phase 2, in which I am a PI. This is an important focus of a wide range of efforts in machine learning in the Munich area. It is a joint effort by LMU and TUM.
-
Jindrich's paper on neural string edit distance will appear at the ACL workshop on structure prediction! See publications.
- New technical report with Kristian Kersting's group on moral norms and multilingual models. Click here for the report.
-
Katharina and Jindrich both have papers in Findings at ACL, congratulations! Katharina's (with Jindrich) is on combining the strengths of static and contextualized models, while Jindrich's (with long-time collaborator Helmut Schmid) explains why character-based MT models are not being used. The works will also be presented as posters at ACL. See publications.
- Report on arXiv by Marion and Matthias, on comparing our two approaches to dealing with target-side morphology in NMT. The two approaches are based on morphological analysis and on stemming. Experiments are on German and Czech. We hope this research will influence future work on translating into other morphologically languages. Here is the report.
- "Understanding Sorbian" on 21 Feb 2022 (International Mother Language Day!): writing about our first (unsupervised) Upper Sorbian/German system and the resonance of the shared tasks on Upper Sorbian and Lower Sorbian that we have organized at ACL-WMT. LMU Newsroom article in German, in English.
- New technical report on cross-lingual detection of hate speech with Irina, Viktor and Iryna Gurevych, click here.
- Lisa Woller and Viktor had a paper on Occitan embeddings at the 2021 Multilingual Representation Learning workshop, see the paper here, nice work!
- WMT 2021: Viktor has a paper on multi-sense words as part of our Cambridge collaboration (together with Flora Liu, Dario, Anna Korhonen), Jindrich organized a shared task and has a system paper, and Lavine has a system paper (with Jindrich), see publications, congrats!
- Irina, Alexandra, Tobias and Simon have first author papers at LT-EDI, NAACL, ACL, and MT Summit, see publications, congrats to them and to their coauthors!
- Dr. rer nat: Dario successfully defended his PhD thesis on "Modeling Contextual Information in Neural Machine Translation"! Thank you Prof. Dr. Philipp Koehn and Prof. Dr. Rico Sennrich for being part of the committee.
- We are organizing the WMT 2021 shared task on Unsupervised MT and Very Low Resource Supervised MT. The tasks are to translate Upper Sorbian and Lower Sorbian to/from German, as well as to translate Chuvash to/from Russian. Please participate! More information here.
- Dario presented a key paper for his PhD research showing that document-level context allows domain adaptation to zero-resource domains (domains which did not appear in the training data).
- New papers at EMNLP, COLING and WMT! See publications.
- Viktor is presenting new work on dealing with many-to-one and one-to-many translations (i.e., compositionality) at AMTA 2020. See publications.
- I held a talk on recent research at the Faculty of Mathematics, Informatics and Statistics (here at LMU Munich), see here for a video.
- Marion Weller-Di Marco's paper on Modeling Word Formation in English-German Neural Machine Translation appeared at ACL. See publications.
- Silvia Severini and Viktor Hangya (in joint work with Hinrich and myself) had the best results in the Russian-English shared task at BUCC 2020. See publications.
- Congratulations to Leah Michel and Viktor Hangya on a study of the challenges of using standard post-hoc-mapped Bilingual Word Embeddings for the low-resource language Hiligaynon at LREC 2020. See publications.
- We are organizing the WMT 2020 shared task on Unsupervised Machine Translation and Very Low Resource Machine Translation. The tasks are to translate from German to Upper Sorbian, and from Upper Sorbian to German. Please participate! More information here.
- Lots of new research papers to be presented in summer 2019, including curriculum training, document MT, unsupervised MT, unsupervised parallel sentence extraction, bilingual lexicon induction for MT OOVs, multilingual POS tagging. Thanks team!
- We are co-organizing the ACL 2019 Conference on Machine Translation Unsupervised Shared Task, please participate!
- We reached the halfway point of DASMT (2.5 years), see here for a summary.
- Post-doc position available, click here.
- PhD position available, German required, click here.
- New research paper at the International Workshop on Spoken Language Translation (IWSLT) from Viktor, Fabienne and Yuliya on unsupervised extraction of parallel sentences from comparable corpora. See publications.
- I gave a keynote on translation to morphologically rich languages at Baltic Human Language Technologies 2018 in Tartu at the end of September.
- New research paper at the conference on machine translation (WMT) from Dario on using oracles to explore context NMT. Dario, Matthias and Viktor also have three new WMT shared task papers (which are unsupervised, bio/news and filtering). See publications.
- Congratulations to Dr. Matthias Huck for the successful defense of his PhD thesis at RWTH Aachen!
- One post-doc and two PhD student positions are available, click here.
- Two new papers at ACL, one on domain adaptation of bilingual tasks by Viktor and Fabienne, and one from Philipp Dufter and others on massively multilingual embeddings. See publications.
- Two new papers, one from Costanza and Matthias on neural POS-tagging of lemma sequences at AMTA 2018, and one from Fabienne, Viktor, and Tobias on bilingual lexicon induction of rare words at NAACL 2018. See publications.
- New job: As of September 1st, I am Professor of Information and Language Processing (CIS, LMU Munich).
- Congratulations to Matthias Huck and Fabienne Braune for their neural machine translation system for English to German translation, which had the best results according to human evaluation at WMT! See the system paper in publications for more details.
- New Papers: Matthias Huck and Simon Riess, Aleš Tamychyna and Marion Weller-Di Marco have two papers about dealing with rich morphology on the target-side in neural machine translation (looking at linguistic segmentation and tag-lemma representation respectively), which were accepted to WMT. Valentin Deyringer has a paper on training with Hogwild! at MTM and Leonie Weißweiler has a paper on German stemming at GSCL. See publications for more details.
- The proceedings of the research track at EAMT (European Association for Machine Translation) 2017, which is in Prague, are out. There are some great research posters, here are the boaster slides (2 slides per poster).
- Congratulations to Anita Ramm and Costanza Conforti for accepted papers (two papers on verbal morphology and preordering and a paper on Venetan respectively)! See publications for more details.
- I'll be the program chair (research track) at EAMT (European Association for Machine Translation) 2017, which will be in Prague.
- Congratulations to Marion Weller-Di Marco, and to Matthias Huck and Aleš Tamchyna, for papers about translation to morphologically rich languages, which were accepted to EACL! See publications for more details.
- One post-doc and one PhD student position are available, click here (deadline has now passed)
- I was interviewed in the Süddeutsche Zeitung (newspaper), see here.
- I gave a keynote on translation to morphologically rich languages at the European Association for Machine Translation (EAMT 2016) in Riga. Here are the slides.
- The project started on December 1st, 2015, I am currently finalizing hiring post-docs.
- I've been awarded a European Research Council Starting Grant! The press release is available from the university web page: in German, English. The agreement has now been signed by the ERC. This project will have a large team with a wide breadth, requiring research expertise ranging from morphological modeling for statistical machine translation, through several forms of domain adaptation, to user interfaces. Stay tuned!
Principal Investigator
Prof. Dr. Alexander Fraser
Present Senior Staff
Dr. Jindrich Libovicky
Dr. Viktor Hangya
Dr. Dario Stojanovski
Dr. Marion Di Marco (Weller)
Past Senior Staff
Dr. Matthias Huck
Dr. Fabienne Braune
Dr. Aleš Tamchyna
Dr. Tsuyoshi Okita
Publications
-
Viktor Hangya, Hossain Shaikh Saadi, Alexander Fraser (2022). Improving Low-Resource Languages in Pre-Trained Language Models. Proceedings of EMNLP 2022, pages 11993-12006, Abu Dhabi, UAE, December. abstract
-
Wen Lai, Alexandra Chronopoulou, Alexander Fraser (2022). m^4 Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 4282-4296, Abu Dhabi, UAE, December. abstract
-
Hossain Shaikh Saadi, Viktor Hangya, Tobias Eder, Alexander Fraser (2022). Comparative Analysis of Cross-lingual Contextualized Word Embeddings. 2nd Multilingual Representation Learning Workshop at EMNLP, pages 64 to 75, Abu Dhabi, UAE, December. abstract
-
Wen Lai, Jindřich Libovický, Alexander Fraser (2022). Improving Both Domain Robustness and Domain Adaptability in Machine Translation. Proceedings of Coling, pages 5191-5204, Gyeongju, Republic of Korea, October. abstract
-
Katharina Hämmerl, Jindřich Libovický, Alexander Fraser (2022). Combining Static and Contextualised Multilingual Embeddings. In Findings of the Association for Computational Linguistics: ACL 2022, pages 2316-2329, Dublin, Ireland, May. abstract preprint link
-
Silvia Severini, Viktor Hangya, Masoud Jalili Sabet, Alexander Fraser, Hinrich Schütze (2022). Don't Forget Cheap Training Signals Before Building Unsupervised Bilingual Word Embeddings. BUCC, 15th Workshop on Building and Using Comparable Corpora, pages 15 to 22, Marseille, France, June. abstract preprint
-
Jindřich Libovický, Alexander Fraser (2022). Neural String Edit Distance. ACL 2022 6th Workshop on Structured Prediction for NLP (SPNLP), pages 52-66, Dublin, Ireland, May. abstract preprint link
-
Jindřich Libovický, Helmut Schmid, Alexander Fraser (2022). Why don't people use character-level machine translation? In Findings of the Association for Computational Linguistics: ACL 2022, pages 2470-2485, Dublin, Ireland, May. abstract preprint link
-
Marion Weller-Di Marco, Matthias Huck, Alexander Fraser (2022). Modeling Target-Side Morphology in Neural Machine Translation: A Comparison of Strategies. arXiv. abstract link
-
Denis Peskov, Viktor Hangya, Jordan Boyd-Graber, Alexander Fraser (2021). Adapting Entities across Languages and Cultures. In Findings of the Association for Computational Linguistics: EMNLP 2021, pages 3725-3750, November. abstract bib
-
Jindřich Libovický, Alexander Fraser (2021). Findings of the WMT 2021 Shared Tasks in Unsupervised MT and Very Low Resource Supervised MT. In Proceedings of the Sixth Conference on Machine Translation (WMT), pages 726-732, Punta Cana, Dominican Republic, November. abstract bib
-
Wen Lai, Jindřich Libovický, Alexander Fraser (2021). The LMU Munich System for the WMT 2021 Large-Scale Multilingual Machine Translation Shared Task. In Proceedings of the Sixth Conference on Machine Translation (WMT), pages 412-417, Punta Cana, Dominican Republic, November. abstract bib
-
Jindřich Libovický, Alexander Fraser (2021). The LMU Munich Systems for the WMT21 Unsupervised and Very Low-Resource Translation Task. In Proceedings of the Sixth Conference on Machine Translation (WMT), pages 989-994, Punta Cana, Dominican Republic, November. abstract bib
-
Viktor Hangya, Qianchu Liu, Dario Stojanovski, Alexander Fraser, Anna Korhonen (2021). Improving machine translation of rare and unseen word senses. In Proceedings of the Sixth Conference on Machine Translation (WMT), pages 614-624, Punta Cana, Dominican Republic, November. abstract bib
-
Lisa Woller, Viktor Hangya, Alexander Fraser (2021). Do not neglect related languages: The case of low-resource Occitan cross-lingual word embeddings. In Proceedings of the 1st Workshop on Multilingual Representation Learning, pages 41–50, Punta Cana, Dominican Republic, November. abstract bib
-
Simon Riess, Matthias Huck, Alexander Fraser (2021). A Comparison of Sentence-Weighting Techniques for NMT. In Proceedings of Machine Translation Summit (MT Summit), pages 176-187, August. abstract bib link video
-
Tobias Eder, Viktor Hangya, Alexander Fraser (2021). Anchor-based Bilingual Word Embeddings for Low-Resource Languages. In Proceedings of the 2021 Annual Conference of the Association for Computational Linguistics (ACL), pages 227-232, August. abstract bib link
-
Alexandra Chronopoulou, Dario Stojanovski, Alexander Fraser (2021). Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Translation. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 173-180, June. abstract bib link
-
Irina Bigoulaeva, Viktor Hangya, Alexander Fraser (2021). Cross-Lingual Transfer Learning for Hate Speech Detection. Proceedings of the First Workshop on Language Technology for Equality, Diversity and Inclusion (LT-EDI), pages 15-25, April. abstract bib link
-
Dario Stojanovski, Alexander Fraser (2021). Addressing Zero-Resource Domains Using Document-Level Context in Neural Machine Translation. Proceedings of the Second Workshop on Domain Adaptation for NLP (AdaptNLP), pages 80-93, April. abstract bib link
-
Silvia Severini, Viktor Hangya, Alexander Fraser, Hinrich Schütze (2020). Combining Word Embeddings with Bilingual Orthography Embeddings for Bilingual Dictionary Induction. In Proceedings of the 28th International Conference on Computational Linguistics (COLING), pages 6044-6055, December. abstract bib link
-
Dario Stojanovski, Benno Krojer, Denis Peskov and Alexander Fraser (2020). ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation. In Proceedings of the 28th International Conference on Computational Linguistics (COLING), pages 4732-4749, December. abstract bib link
-
Alexandra Chronopoulou, Dario Stojanovski, Viktor Hangya, Alexander Fraser (2020). The LMU Munich System for the WMT 2020 Unsupervised Machine Translation Shared Task. In ACL 2020 Fifth Conference on Machine Translation (WMT20), pages 1084-1091, November. abstract bib link
-
Alexander Fraser (2020). Findings of the WMT 2020 Shared Tasks in Unsupervised MT and Very Low Resource Supervised MT. In ACL 2020 Fifth Conference on Machine Translation (WMT20), pages 765-771, November. abstract bib link
-
Jindřich Libovický, Viktor Hangya, Helmut Schmid, Alexander Fraser (2020). The LMU Munich System for the WMT20 Very Low Resource Supervised MT Task. In ACL 2020 Fifth Conference on Machine Translation (WMT20), pages 1104-1111, November. abstract bib link
-
Alexandra Chronopoulou, Dario Stojanovski, Alexander Fraser (2020). Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 2703-2711, November. abstract bib link
-
Jindřich Libovický and Alexander Fraser (2020). Towards Reasonably-Sized Character-Level Transformer NMT by Finetuning Subword Systems. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 2572-2579, November. abstract bib link
-
Jindřich Libovický, Rudolf Rosa, Alexander Fraser (2020). On the Language Neutrality of Pre-trained Multilingual Representations. In Findings of ACL: EMNLP 2020, pages 1663-1674, November. abstract bib link
-
Yi-Chen Liu, Yuri Pettinicchi, Alexander Fraser (2020). Improving SHARE translation verification. In BigSurv20, November. abstract link
-
Dario Stojanovski, Alexander Fraser (2020). Addressing Zero-Resource Domains Using Document-Level Context in Neural Machine Translation. arXiv. abstract bib link
-
Viktor Hangya, Alexander Fraser (2020). Towards Handling Compositionality in Low-Resource Bilingual Word Induction. In Proceedings of the 14th Conference of the Association for Machine Translation in the Americas (AMTA), pages 89-101, October. abstract bib
-
Marion Weller-Di Marco, Alexander Fraser (2020). Modeling Word Formation in English-German Neural Machine Translation. In Proceedings of the 2020 Annual Conference of the Association for Computational Linguistics (ACL), pages 4227-4232, July. abstract bib
-
Silvia Severini, Viktor Hangya, Alexander Fraser, Hinrich Schütze (2020). LMU Bilingual Dictionary Induction System with Word Surface Similarity Scores for BUCC 2020. In Proceedings of the 13th Workshop on Building and Using Comparable Corpora (BUCC), pages 49-55, May. abstract bib
-
Leah Michel, Viktor Hangya, Alexander Fraser (2020). Exploring Bilingual Word Embeddings for Hiligaynon, a Low-Resource Language. In Proceedings of the 12th Edition of the Language Resources and Evaluation Conference (LREC), pages 2573-2580, May. abstract bib
-
Jindřich Libovický, Rudolf Rosa, Alexander Fraser (2019). How Language-Neutral is Multilingual BERT?. arXiv. abstract
-
Dario Stojanovski, Alexander Fraser (2019). Improving Anaphora Resolution in Neural Machine Translation Using Curriculum Learning. In Proceedings of the Machine Translation Summit 2019, pages 140-150, Dublin, Ireland, August. abstract
-
Dario Stojanovski, Viktor Hangya, Matthias Huck, Alexander Fraser (2019). The LMU Munich Unsupervised Machine Translation System for WMT19. In Proceedings of the Fourth Conference on Machine Translation, pages 592-598, Florence, Italy, August. abstract
-
Dario Stojanovski, Alexander Fraser (2019). Combining Local and Document-Level Context: The LMU Munich Neural Machine Translation System at WMT19. In Proceedings of the Fourth Conference on Machine Translation, pages 599-605, Florence, Italy, August. abstract
-
Viktor Hangya, Alexander Fraser (2019). Unsupervised Parallel Sentence Extraction with Parallel Segment Detection Helps Machine Translation. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pages 1224–1234, Florence, Italy, July. abstract
-
Matthias Huck, Viktor Hangya, and Alexander Fraser (2019). Better OOV Translation with Bilingual Terminology Mining. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pages 5809-5815, Florence, Italy, July. abstract
-
Matthias Huck, Diana Dutka, and Alexander Fraser (2019). Cross-lingual Annotation Projection Is Effective for Neural Part-of-Speech Tagging. In Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial), pages 223-233, Minneapolis, Minnesota, USA, June. abstract
-
Viktor Hangya, Fabienne Braune, Yuliya Kalasouskaya, Alexander Fraser (2018). Unsupervised Parallel Sentence Extraction from Comparable Corpora. In Proceedings of the 15th International Workshop on Spoken Language Translation (IWSLT), pages 7-13, Bruges, Belgium, October. abstract
-
Dario Stojanovski, Alexander Fraser (2018). Coreference and Coherence in Neural Machine Translation: A Study Using Oracle Experiments. In Proceedings of the Third Conference on Machine Translation, pages 49-60, Brussels, Belgium, October. abstract
-
Viktor Hangya, Alexander Fraser (2018). An Unsupervised System for Parallel Corpus Filtering. In Proceedings of the Third Conference on Machine Translation, pages 882-887, Brussels, Belgium, October. abstract
-
Matthias Huck, Dario Stojanovski, Viktor Hangya, Alexander Fraser (2018). LMU Munich's Neural Machine Translation Systems at WMT 2018. In Proceedings of the Third Conference on Machine Translation, pages 648-654, Brussels, Belgium, October. abstract
-
Dario Stojanovski, Viktor Hangya, Matthias Huck, Alexander Fraser (2018). The LMU Munich Unsupervised Machine Translation Systems. In Proceedings of the Third Conference on Machine Translation, pages 513-521, Brussels, Belgium, October. abstract
-
Viktor Hangya, Fabienne Braune, Alexander Fraser, Hinrich Schütze (2018). Two Methods for Domain Adaptation of Bilingual Tasks: Delightfully Simple and Broadly Applicable. In Proceedings of the 54th Annual Conference of the Association for Computational Linguistics (ACL), pages 810-820, Melbourne, Australia, July. abstract
-
Philipp Dufter, Mengjie Zhao, Martin Schmitt, Alexander Fraser, Hinrich Schütze (2018). Embedding Learning Through Multilingual Concept Induction. In Proceedings of the 54th Annual Conference of the Association for Computational Linguistics (ACL), pages 1520-1530, Melbourne, Australia, July. abstract
-
Fabienne Braune, Viktor Hangya, Tobias Eder, Alexander Fraser (2018). Evaluating Bilingual Word Embeddings on the Long Tail. In Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), pages 188-193, New Orleans, Louisiana, USA, June. abstract
-
Costanza Conforti, Matthias Huck, and Alexander Fraser (2018). Neural Morphological Tagging of Lemma Sequences for Machine Translation. In Proceedings of the 13th Conference of the Association for Machine Translation in the Americas (AMTA), vol. 1: MT Research Track, pages 39-53, Boston, Massachusetts, USA, March. abstract
-
Matthias Huck, Simon Riess, Alexander Fraser (2017). Target-side Word Segmentation Strategies for Neural Machine Translation. In Proceedings of the Second Conference on Machine Translation, pages 56-67, Copenhagen, Denmark, September. abstract
-
Aleš Tamchyna, Marion Weller-Di Marco, Alexander Fraser (2017). Modeling Target-Side Inflection in Neural Machine Translation. In Proceedings of the Second Conference on Machine Translation, pages 32-42, Copenhagen, Denmark, September. abstract
-
Matthias Huck, Fabienne Braune, Alexander Fraser (2017). LMU Munich's Neural Machine Translation Systems for News Articles and Health Information Texts. In Proceedings of the Second Conference on Machine Translation, pages 315-322, Copenhagen, Denmark, September. abstract
-
Valentin Deyringer, Alexander Fraser, Helmut Schmid, Tsuyoshi Okita (2017). Parallelization of Neural Network Training for NLP with Hogwild! The Prague Bulletin of Mathematical Linguistics, No. 109, pages 29-38. abstract software
-
Leonie Weißweiler, Alexander Fraser (2017). Developing a Stemmer for German Based on a Comparative Analysis of Publicly Available Stemmers. In Proceedings of the German Society for Computational Linguistics and Language Technology (GSCL), Berlin, Germany, September. abstract software
-
Hassan Sajjad, Helmut Schmid, Alexander Fraser, Hinrich Schütze (2017). Statistical Models for Unsupervised, Semi-supervised and Supervised Transliteration Mining. Computational Linguistics. 43(2), pages 349-375. abstract preprint
-
Anita Ramm, Sharid Loáiciga, Annemarie Friedrich and Alexander Fraser (2017). Annotating Tense, Mood and Voice for English, French and German. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL), demo session, pages 1-6, Vancouver, Canada, July. abstract
-
Matthias Huck, Aleš Tamchyna, Ondřej Bojar, Alexander Fraser (2017). Producing Unseen Morphological Variants in Statistical Machine Translation. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pages 369-375, Valencia, Spain, April. abstract
-
Marion Weller-Di Marco, Alexander Fraser, Sabine Schulte im Walde (2017). Addressing Problems across Linguistic Levels in SMT: Combining Approaches to Model Morphology, Syntax and Lexical Choice. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pages 625-630, Valencia, Spain, April. abstract
-
Aleš Tamchyna, Alexander Fraser, Ondřej Bojar, Marcin Junczys-Dowmunt (2016). Target-side Context for Discriminative Models in Statistical Machine Translation. In Proceedings of the 54th Annual Conference of the Association for Computational Linguistics (ACL), pages 1704-1714, Berlin, Germany, August. abstract
-
Anita Ramm, Alexander Fraser (2016). Modeling Verbal Inflection for English to German SMT. In Proceedings of the First Conference on Machine Translation, pages 21-31, Berlin, Germany, August. abstract
-
Marion Weller-Di Marco, Alexander Fraser, Sabine Schulte im Walde (2016). Modeling Complement Types in Phrase-based SMT. In Proceedings of the First Conference on Machine Translation, pages 43-53, Berlin, Germany, August. abstract
-
Fabienne Braune, Alexander Fraser, Hal Daumé III, Aleš Tamchyna (2016). A Framework for Discriminative Rule Selection in Hierarchical Moses. In Proceedings of the First Conference on Machine Translation, pages 92-101, Berlin, Germany, August. abstract
Acknowledgments
This project has received funding from the European Research Council
(ERC) under the European Union’s Horizon 2020 research and
innovation programme (grant agreement No 640550).