{"392451":{"#nid":"392451","#data":{"type":"event","title":"Ph.D Proposal Defense by Yi Yang","body":[{"value":"\u003Cp\u003EPh.D. Thesis Proposal Announcement\u003Cbr \/\u003E\u003Cbr \/\u003E\u003Cstrong\u003ETitle: Robust Adaptation of Natural Language Processing for Language Variation\u003C\/strong\u003E\u003Cbr \/\u003E\u003Cbr \/\u003E\u003Cstrong\u003EYi Yang\u003C\/strong\u003E\u003Cbr \/\u003EPh.D. Student\u003Cbr \/\u003ESchool of Interactive Computing\u003Cbr \/\u003ECollege of Computing\u003Cbr \/\u003EGeorgia Institute of Technology\u003Cbr \/\u003E\u003Ca href=\u0022http:\/\/www.cc.gatech.edu\/~yyang319\/\u0022 target=\u0022_blank\u0022\u003Ehttp:\/\/www.cc.gatech.edu\/~yyang319\/\u003C\/a\u003E\u003Cbr \/\u003E\u003Cbr \/\u003EDate: Tuesday, April 7, 2015\u003Cbr \/\u003ETime: 3:00pm \u2013 5:00pm EDT\u003Cbr \/\u003ELocation: Klaus 1212\u003Cbr \/\u003E\u003Cbr \/\u003E\u003Cstrong\u003ECommittee\u003C\/strong\u003E\u003Cbr \/\u003EDr. Jacob Eisenstein (Advisor), School of Interactive Computing, Georgia Institute of Technology\u003Cbr \/\u003EDr. James M. Rehg, School of Interactive Computing, Georgia Institute of Technology\u003Cbr \/\u003EDr. Duen Horng (Polo) Chau, School of Computational Science \u0026amp; Engineering, Georgia Institute of Technology\u003Cbr \/\u003EDr. Byron Boots, School of Interactive Computing, Georgia Institute of Technology\u003Cbr \/\u003E\u003Cbr \/\u003E\u003Cbr \/\u003E\u003Cstrong\u003EAbstract:\u003C\/strong\u003E\u003Cbr \/\u003ENatural Language Processing (NLP) technology has been applied in various domains, ranging from social media and digital humanities to public health. Unfortunately, the adoption of existing NLP techniques in these areas often experiences unsatisfactory performances, as existing NLP techniques are driven by standard corpora, which is vulnerable to variation in languages of new datasets and settings. Previous approaches toward this problem suffer from two major weaknesses. First, they usually employ supervised methods that require expensive annotations and easily become outdated with respect to the dynamic nature of languages. Second, they often fail to leverage the valuable metadata associated with the target languages of these areas. \u003Cbr \/\u003E\u003Cbr \/\u003EIn this thesis, I propose to overcome these weaknesses by exploring unsupervised learning techniques to build NLP systems that are robust to language variation, primarily branching into: a) unsupervised text normalization, transforming lexical variations into text that better matches standard datasets; b) unsupervised domain adaptation, adapting standard NLP tools to fit the text with variation directly, through learning of representations that are robust to variation; c) personalized natural language processing, incorporating user metadata to adapt generic NLP to each individual user. These approaches are driven by co-occurrence statistics as well as rich metadata without the need of costly annotations, and can easily adapt to new settings. My preliminary work on text normalization and domain adaptation delivers state-of-the-art NLP systems for social media and historical text. As a future work, I propose to further boost the results by leveraging various user metadata.\u003Cbr \/\u003E\u003Cbr \/\u003E\u003C\/p\u003E","summary":null,"format":"limited_html"}],"field_subtitle":"","field_summary":"","field_summary_sentence":[{"value":"Robust Adaptation of Natural Language Processing for Language Variation"}],"uid":"27707","created_gmt":"2015-04-01 09:08:43","changed_gmt":"2016-10-08 01:45:59","author":"Tatianna Richardson","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2015-04-07T16:00:00-04:00","event_time_end":"2015-04-07T18:00:00-04:00","event_time_end_last":"2015-04-07T18:00:00-04:00","gmt_time_start":"2015-04-07 20:00:00","gmt_time_end":"2015-04-07 22:00:00","gmt_time_end_last":"2015-04-07 22:00:00","rrule":null,"timezone":"America\/New_York"},"extras":[],"groups":[{"id":"221981","name":"Graduate Studies"}],"categories":[],"keywords":[{"id":"1366","name":"defense"},{"id":"1808","name":"graduate students"},{"id":"121281","name":"Phd."},{"id":"3395","name":"proposal"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[{"id":"1788","name":"Other\/Miscellaneous"}],"invited_audience":[{"id":"78771","name":"Public"}],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[],"email":[],"slides":[],"orientation":[],"userdata":""}}}