<node id="635860">
  <nid>635860</nid>
  <type>event</type>
  <uid>
    <user id="27544"><![CDATA[27544]]></user>
  </uid>
  <created>1591027640</created>
  <changed>1591027763</changed>
  <title><![CDATA[ARC and Indo-US Virtual Center Seminar: Pravesh Kothari (CMU)]]></title>
  <body><![CDATA[<p align = "center"><strong>Algorithms &amp; Randomness Center (ARC) and Indo-US Virtual Center Seminar</strong></p>

<p align = "center"><strong>Pravesh Kothari (CMU)</strong></p>

<p align = "center"><strong>Monday, June 8, 2020</strong></p>

<p align = "center"><strong>Virtual via Bluejeans - 11:30 am</strong></p>

<p align = "center">&nbsp;</p>

<p><strong>Title:&nbsp; </strong>Outlier-robust Clustering of Gaussian Mixtures</p>

<p><strong>Abstract:&nbsp; </strong>We give efficient algorithms for robustly clustering of mixtures of &quot;reasonable&quot; distributions, including the well-known open problem of robustly clustering a mixture of arbitrary Gaussians. Specifically, we&nbsp;give an outlier-robust efficient algorithm for clustering a mixture of k Gaussians with pairwise TV distance 1-exp(k^k/\eta). The running time of our algorithm is d^{(k/\eta)^{O(k)}}. More generally, our algorithm succeeds for mixtures of distributions that satisfy two well-studied analytic assumptions - certifiable hypercontractivity and anti-concentration. Thus, it extends to clustering mixtures of arbitrary affine transforms of the uniform distribution on the d-dimensional unit sphere. Even the information-theoretic clusterability of distributions satisfying our analytic assumptions was not known and is likely to be of independent interest. Our techniques expand the sum-of-squares toolkit to show robust certifiability of TV-separated Gaussian clusters in data. This involves a low-degree sum-of-squares proof of statements that relate parameter distance to total variation distance simply relying on hypercontractivity and anti-concentration.</p>

<p>It remains open to improve the running time of the algorithms and to give a robust parameter estimation algorithm for Gaussian mixtures with no separation assumptions.&nbsp;</p>

<p>Based on joint work with Ainesh Bakshi (CMU).&nbsp;</p>

<p>----------------------------------</p>

<p><a href="https://www.cs.princeton.edu/~kothari/">Speaker&#39;s Webpage</a></p>

<p><em>Videos of recent talks are available at: </em><a href="https://smartech.gatech.edu/handle/1853/46836"><em>https://smartech.gatech.edu/handle/1853/46836</em></a></p>

<p><a href="https://mailman.cc.gatech.edu/mailman/listinfo/arc-colloq"><em>Click here to subscribe to the seminar email list: arc-colloq@Klauscc.gatech.edu </em></a></p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Outlier-robust Clustering of Gaussian Mixtures - Virtual via Bluejeans at 11:30am]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2020-06-08T12:30:00-04:00]]></value>
      <value2><![CDATA[2020-06-08T13:30:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Postdoc]]></value>
      </item>
          <item>
        <value><![CDATA[Graduate students]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>70263</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[ARC]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1795</tid>
        <value><![CDATA[Seminar/Lecture/Colloquium]]></value>
      </item>
      </field_categories>
  <field_keywords>
      </field_keywords>
  <userdata><![CDATA[]]></userdata>
</node>
