{"627561":{"#nid":"627561","#data":{"type":"event","title":"ML@GT Seminar: David Bamman, University of California, Berkeley","body":[{"value":"\u003Cp\u003E\u003Ca href=\u0022http:\/\/ml.gatech.edu\/\u0022 target=\u0022_blank\u0022\u003EThe Machine Learning Center\u003C\/a\u003E\u0026nbsp;invites you to a lecture by\u0026nbsp;\u003Cstrong\u003EDavid Bamman\u003C\/strong\u003E, an assistant professor in the School of Information at UC Berkeley.\u0026nbsp;\u003Cbr \/\u003E\r\n\u003Cbr \/\u003E\r\nThe lecture will be held at 12:45 p.m.\u0026nbsp;on Friday, November 15\u0026nbsp;in Klaus 2443.\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Ca href=\u0022https:\/\/docs.google.com\/forms\/d\/e\/1FAIpQLSfm-v7A0lngKJznZLjzdcmC9Y9coGoXm7sNyNl6DT9mbIKqxg\/viewform?usp=sf_link\u0022 target=\u0022_blank\u0022 title=\u0022https:\/\/docs.google.com\/forms\/d\/e\/1FAIpQLSfm-v7A0lngKJznZLjzdcmC9Y9coGoXm7sNyNl6DT9mbIKqxg\/viewform?usp=sf_link\u0022\u003ERSVP Here\u003C\/a\u003E\u003Cbr \/\u003E\r\n\u003Cbr \/\u003E\r\n\u003Cstrong\u003ETitle:\u0026nbsp;\u003C\/strong\u003EThe Data-Driven Analysis of Literature\u003Cbr \/\u003E\r\n\u003Cbr \/\u003E\r\n\u003Cstrong\u003EAbstract:\u003C\/strong\u003E\u0026nbsp;Literary novels push the limits of natural language processing. While much work in NLP has been heavily optimized toward the narrow domains of news and Wikipedia, literary novels are an entirely different animal--the long, complex sentences in novels strain the limits of syntactic parsers with super-linear computational complexity, their use of figurative language challenges representations of meaning based on neo-Davidsonian semantics, and their long length (ca. 100,000 words on average) rules out existing solutions for problems like coreference resolution that expect a small set of candidate antecedents.\u003Cbr \/\u003E\r\n\u003Cbr \/\u003E\r\nAt the same time, fiction drives computational research questions that are uniquely interesting to that domain. In this talk, I\u0026#39;ll outline some of the opportunities that NLP presents for research in the quantitative analysis of culture--including measuring the disparity in attention given to characters as a function of their gender over two hundred years of literary history (Underwood et al. 2018)--and describe our progress to date on two problems essential to a more complex representation of plot: recognizing the entities in literary texts, such as the characters, locations, and spaces of interest (Bamman et al. 2019) and identifying the events that are depicted as having transpired (Sims et al. 2019).\u0026nbsp;\u0026nbsp;Both efforts involve the creation of a new dataset of 200,000 words evenly drawn from 100 different English-language literary texts and building computational models to automatically identify each phenomenon.\u003Cbr \/\u003E\r\n\u003Cbr \/\u003E\r\nThis is joint work with Matt Sims, Ted Underwood, Sabrina Lee, Jerry Park, Sejal Popat and Sheng Shen\u003Cbr \/\u003E\r\n\u003Cbr \/\u003E\r\n\u003Cstrong\u003EBio:\u0026nbsp;\u003C\/strong\u003EDavid Bamman is an assistant professor in the School of Information at UC Berkeley, where he works on applying natural language processing and machine learning to empirical questions in the humanities and social sciences. His research often involves adding linguistic structure (e.g., syntax, semantics, coreference) to statistical models of text, and focuses on improving NLP for a variety of languages and domains (such as literary text and social media).\u0026nbsp;\u0026nbsp;Before Berkeley, he received his PhD in the School of Computer Science at Carnegie Mellon University (LTI).\u003C\/p\u003E\r\n","summary":null,"format":"limited_html"}],"field_subtitle":"","field_summary":"","field_summary_sentence":[{"value":"The Machine Learning Center at Georgia Tech\u00a0invites you to a seminar by David Bamman, an assistant professor from the University of California, Berkeley."}],"uid":"34773","created_gmt":"2019-10-14 17:06:48","changed_gmt":"2019-10-29 16:53:22","author":"ablinder6","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2019-11-15T12:45:00-05:00","event_time_end":"2019-11-15T13:45:00-05:00","event_time_end_last":"2019-11-15T13:45:00-05:00","gmt_time_start":"2019-11-15 17:45:00","gmt_time_end":"2019-11-15 18:45:00","gmt_time_end_last":"2019-11-15 18:45:00","rrule":null,"timezone":"America\/New_York"},"extras":[],"hg_media":{"627562":{"id":"627562","type":"image","title":"David Bamman","body":null,"created":"1571072894","gmt_created":"2019-10-14 17:08:14","changed":"1571072894","gmt_changed":"2019-10-14 17:08:14","alt":"","file":{"fid":"238937","name":"bamman_pic.jpg","image_path":"\/sites\/default\/files\/images\/bamman_pic.jpg","image_full_path":"http:\/\/www.tlwarc.hg.gatech.edu\/\/sites\/default\/files\/images\/bamman_pic.jpg","mime":"image\/jpeg","size":110740,"path_740":"http:\/\/www.tlwarc.hg.gatech.edu\/sites\/default\/files\/styles\/740xx_scale\/public\/images\/bamman_pic.jpg?itok=-aEqg9-G"}}},"media_ids":["627562"],"groups":[{"id":"47223","name":"College of Computing"},{"id":"1299","name":"GVU Center"},{"id":"576481","name":"ML@GT"}],"categories":[],"keywords":[],"core_research_areas":[],"news_room_topics":[],"event_categories":[{"id":"1795","name":"Seminar\/Lecture\/Colloquium"}],"invited_audience":[{"id":"78761","name":"Faculty\/Staff"},{"id":"177814","name":"Postdoc"},{"id":"78771","name":"Public"},{"id":"174045","name":"Graduate students"},{"id":"78751","name":"Undergraduate students"}],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[{"value":"\u003Cp\u003EKyla Hanson\u003C\/p\u003E\r\n\r\n\u003Cp\u003Ekhanson@cc.gatech.edu\u003C\/p\u003E\r\n","format":"limited_html"}],"email":[],"slides":[],"orientation":[],"userdata":""}}}