<node id="618789">
  <nid>618789</nid>
  <type>event</type>
  <uid>
    <user id="27544"><![CDATA[27544]]></user>
  </uid>
  <created>1551796425</created>
  <changed>1552998905</changed>
  <title><![CDATA[ARC Colloquium: Ravi Kannan (MSR)]]></title>
  <body><![CDATA[<p align = "center"><strong>Algorithms &amp; Randomness Center (ARC)</strong></p>

<p align = "center"><strong>Ravi Kannan (MSR)</strong></p>

<p align = "center"><strong>Monday, March 25, 2019</strong></p>

<p align = "center"><strong>Klaus 1116E - 11:00 am</strong></p>

<p>&nbsp;</p>

<p><strong>Title:&nbsp; </strong>A General Algorithm for Unsupervised Learning problems</p>

<p><strong>Abstract:&nbsp; </strong>The following simply-stated geometric problem includes as special cases the core problems of a&nbsp; number&nbsp; of&nbsp; areas&nbsp; in&nbsp; Unsupervised&nbsp; Learning,&nbsp; including,&nbsp; Topic&nbsp; Modeling,&nbsp; Non-negative&nbsp; Matrix Factorization, Clustering, Stochastic Block Models and Overalapping Communities Detection:</p>

<p>There is an unknown polytope <em>K</em>&nbsp; &isin; <strong>R</strong><em><sup>d</sup></em> with<em> k</em> vertices.&nbsp; We are given n data points, each aperturbation of some point in <em>K.</em>&nbsp; The problem is to find <em>K</em>, i.e., its vertices (approximately).&nbsp; [The perturbations are large; indeed, many data points lie outside<em> K</em>.]</p>

<p>Our main result is an algorithm which solves this general problem under two natural assumptions.&nbsp; Our assumptions are technically different,&nbsp; but similar in spirit to existing models for the special cases.&nbsp; We assume separation between the vertices of<em> K</em>&nbsp; and the existence of &ldquo;pure&rdquo; data points whose unperturbed versions are close to the vertices of <em>K</em>.&nbsp; Notably we do not assume any stochastic&nbsp; model&nbsp; of&nbsp; data.&nbsp;&nbsp; Our&nbsp; algorithm&nbsp; has&nbsp; better&nbsp; complexity&nbsp; than&nbsp; known&nbsp; algorithms&nbsp; for&nbsp; the special cases when the input matrix<strong> A</strong> is sparse and k is relatively small compared to <em>n, d</em>.</p>

<p>The algorithm is simply stated, but the proof of correctness is involved.&nbsp; It draws on tools in Numerical Analysis, especially perturbation of singular spaces of matrices.&nbsp; Here is a description of our algorithm:&nbsp; It has <em>k</em> stages; in each stage, it picks a certain random vector <em>u</em>, finds the <em>m</em> largest<em> u &middot; x </em>over data points<em> x</em> and outputs the average of these data points as an approximation to a new vertex of <em>K</em>.</p>

<p>Joint Work with C. Bhattacharyya</p>

<p>&nbsp;</p>

<p>----------------------------------</p>

<p><a href="https://simons.berkeley.edu/people/ravi-kannan">Speaker&#39;s Webpage</a></p>

<p><em>Videos of recent talks are available at: </em><a href="https://smartech.gatech.edu/handle/1853/46836"><em>https://smartech.gatech.edu/handle/1853/46836</em></a></p>

<p><a href="https://mailman.cc.gatech.edu/mailman/listinfo/arc-colloq"><em>Click here to subscribe to the seminar email list: arc-colloq@cc.gatech.edu </em></a></p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[A General Algorithm for Unsupervised Learning problems - Klaus 1116 East at 11 am]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2019-03-25T12:00:00-04:00]]></value>
      <value2><![CDATA[2019-03-25T13:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Postdoc]]></value>
      </item>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
          <item>
        <value><![CDATA[Graduate students]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>70263</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[ARC]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1789</tid>
        <value><![CDATA[Conference/Symposium]]></value>
      </item>
      </field_categories>
  <field_keywords>
      </field_keywords>
  <userdata><![CDATA[]]></userdata>
</node>
