<node id="618856">
  <nid>618856</nid>
  <type>event</type>
  <uid>
    <user id="34977"><![CDATA[34977]]></user>
  </uid>
  <created>1551820574</created>
  <changed>1551880764</changed>
  <title><![CDATA[ISyE Statistic Seminar - Chao Zhang]]></title>
  <body><![CDATA[<p><strong>Title:</strong></p>

<p>Multidimensional Text Mining with Limited Supervision</p>

<p><strong>Abstract:</strong></p>

<p>Unstructured text, as one of the most important data forms, plays a crucial role in domains such as cybersecurity, healthcare informatics, and&nbsp;cyber-physical systems. In many emerging applications, people&#39;s information&nbsp;need from text data is becoming multidimensional---they demand useful insights&nbsp;along multiple aspects from the given text corpus. However, acquiring&nbsp;multidimensional knowledge from massive text data challenges existing data&nbsp;mining techniques. In this talk, I will present a structuring-and-mining&nbsp;framework for facilitating acquiring multidimensional knowledge from text data.&nbsp;It organizes unstructured text into a multidimensional and multi-granular&nbsp;structure, from which end users can easily select relevant data with&nbsp;declarative queries and apply any data mining primitives thereafter. I will&nbsp;detail two core algorithms in this framework, including (1) a weakly supervised&nbsp;text classification algorithm; and (2) an abnormal event detection algorithm.&nbsp;The algorithms in the framework all require little supervision and are thus&nbsp;particularly appealing in scenarios where labeled data are expensive to&nbsp;acquire.</p>

<p><strong>Bio:</strong></p>

<p>Chao Zhang is an Assistant Professor at College of Computing, Georgia Institute&nbsp;of Technology. His research area is data mining and machine learning. He is&nbsp;particularly interested in developing label-efficient and robust learning&nbsp;techniques, with applications in text mining and spatiotemporal data mining.&nbsp;Chao has published more than 40 papers in top-tier conferences and journals,&nbsp;such as KDD, WWW, SIGIR, VLDB, and TKDE.&nbsp; He is the recipient of the ECML/PKDD&nbsp;Best Student Paper Runner-up Award (2015) and the Chiang Chen Overseas Graduate&nbsp;Fellowship (2013). Before joining Georgia Tech, he obtained his Ph.D. degree in&nbsp;Computer Science from University of Illinois at Urbana-Champaign in 2018.</p>

<p>&nbsp;</p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Multidimensional Text Mining with Limited Supervision]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p><strong>Abstract:</strong></p>

<p>Unstructured text, as one of the most important data forms, plays a crucial role in domains such as cybersecurity, healthcare informatics, and&nbsp;cyber-physical systems. In many emerging applications, people&#39;s information&nbsp;need from text data is becoming multidimensional---they demand useful insights&nbsp;along multiple aspects from the given text corpus. However, acquiring&nbsp;multidimensional knowledge from massive text data challenges existing data&nbsp;mining techniques. In this talk, I will present a structuring-and-mining&nbsp;framework for facilitating acquiring multidimensional knowledge from text data.&nbsp;It organizes unstructured text into a multidimensional and multi-granular&nbsp;structure, from which end users can easily select relevant data with&nbsp;declarative queries and apply any data mining primitives thereafter. I will&nbsp;detail two core algorithms in this framework, including (1) a weakly supervised&nbsp;text classification algorithm; and (2) an abnormal event detection algorithm.&nbsp;The algorithms in the framework all require little supervision and are thus&nbsp;particularly appealing in scenarios where labeled data are expensive to&nbsp;acquire.</p>
]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2019-03-11T15:00:00-04:00]]></value>
      <value2><![CDATA[2019-03-11T16:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Postdoc]]></value>
      </item>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
          <item>
        <value><![CDATA[Graduate students]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[https://www.isye.gatech.edu/about/maps-directions/isye-building-complex]]></url>
      <title><![CDATA[ISyE Building ]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>1242</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[School of Industrial and Systems Engineering (ISYE)]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1795</tid>
        <value><![CDATA[Seminar/Lecture/Colloquium]]></value>
      </item>
      </field_categories>
  <field_keywords>
      </field_keywords>
  <userdata><![CDATA[]]></userdata>
</node>
