I am an assistant professor at the Department of Computer Science, Courant Institute of New York University and a research fellow at the CILVR Lab. My research interests include studying generalization in artificial intelligence models, particularly sequence modeling architectures, through self-supervised and transfer/few-shot learning methods. I also actively work on developing resources and methodology for low-resource natural language processing.
The main goal of my research is extending the applicability of language technologies in more languages and tasks. Language technology is a highly promising tool with the potential of transforming how we use and benefit from education and the media, although state-of-the-art approaches are still not competitive enough to be deployed in a large portion of the world languages. This problem is related to the nature of assumptions made in formulating the statistical language models that usually fail to generalize to languages with various syntactic typology, in particular the ones with relatively high data sparsity. I find this highly constrained design problem quite intriguing due to its potential in benefiting a tremendous amount of future applications; at the same time, in line with my background in engineering, where I had specialised on developing optimized software solutions for real-time computation-intensive multimedia technology. Previous to joining the New York University, I was a post-doctoral researcher and lecturer at the University of Zürich, and an applied research scientist intern at the Amazon Alexa research, where I worked on developing novel methods for improving generative models in low-resource and morphologically-rich languages.
Ph.D. in Information Engineering and Computer Science, 2019
University of Trento
Ph.D. in Informatics (Visiting Post-graduate Student), 2018
University of Edinburgh
M.Sc. in Embedded Systems and Multimedia Technology, 2015
University of Leuven
B.Sc. in Electrical and Electronics Engineering, 2013
Middle East Technical University
I will be giving an invited talk at WAT2022, the 9th Workshop on Asian Translation collocated with COLING, on October 17th 2022.
ACL has confirmed the establishment of the new Special Interest Group on Turkic Languages (SIGTURK), which will aim to promote progress in the field of computational linguistics in Turkic Languages. I will serve as the Chair while Sardana Ivanova will serve as the Secretary.
Our paper “Quantifying Synthesis and Fusion and their Impact on Machine Translation” has been accepted to appear at NAACL 2022. The paper presents novel insights on the morphosyntactic characteristics of generated words by machine translation systems in languages with different morphological typology and was written in collaboration with the University of Edinburgh, Aalborg University and New York University.
I will be serving as an Area Chair in the area of “Low-resourced and less-studied languages” at COLING 2022.
The Workshop on Multilingual Representation Learning (MRL) was organized for the first time in conjunction with EMNLP 2021 on November 11, 2021. Here are some highlights from the research findings presented at the workshop.