Dr. Nathan M. White

Photo Nathan White

Dr. Nathan M. White

Dr. Nathan M. White has worked for nearly a decade with the Hmong diaspora community with a focus on applied natural language processing-based methods of linguistic analysis of an under-documented, low-resource language with origins in Southeast Asia. As part of his PhD, he produced the Hmong Medical Corpus, a gold-standard biomedical corpus with an integrated artificial intelligence-based question-answering system, which represents the first such system for a low-resource minority language.

Currently, Dr. White is a postdoctoral research associate with continuing research in machine learning and fieldwork-based data acquisition and analysis in the Hmong community, with a focus on developing novel computational methods of analysis for linguistic data as well as AI-based NLP applications in a low-resource environment.

His interests include computational linguistics, natural language processing, machine learning, language documentation and typology, big data analytics and corpus linguistics, multilingualism and language acquisition, and developing AI for low-resource and minority languages. His ongoing academic research focuses on the Hmong language and other Hmongic languages of Southeast Asia as well as the Yokuts languages of central California, with forays into financial NLP.