<node id="663534">
  <nid>663534</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1669826026</created>
  <changed>1669826026</changed>
  <title><![CDATA[MS Proposal by Joshua Kuperman]]></title>
  <body><![CDATA[<p>Joshua Kuperman<br />
[Advisor: Dr. Evangelos Theodorou]<br />
will propose a master&rsquo;s thesis entitled,<br />
Integrating Perception into Safe Differentiable Control<br />
On<br />
Tuesday, December 13 at 10:00 a.m.<br />
Weber SS&amp;T 200<br />
Abstract<br />
A great challenge exists at the intersection of perception and controls &ndash; integrating the uncertainty<br />
present in perception-based state and obstacle estimation into safe control and trajectory optimization.<br />
This thesis proposes a model-based learning framework with a policy defined by a safe differentiable<br />
optimal controller. We will leverage many of the ideas of the world model, an unsupervised<br />
reinforcement learning technique that has achieved human-level or better-than-human performance on<br />
many Atari games. Specifically, we intend on training a variational autoencoder, a common<br />
unsupervised image processing technique, to learn a latent space representation that is decoded into a<br />
form that a safety function can be defined on, such as a depth map or occupancy grid. We will learn the<br />
dynamics of this latent space, as well as a mapping from the latent space directly to a safety function, to<br />
provide a differentiable controller with information on how the agent and the environment changes<br />
over time as a function of the control actions. The controller will have the safety function embedded<br />
into the dynamics using barrier states. The barrier state (BaS), and its discrete counterpart (DBaS), is a<br />
recently developed method of embedding the safety of a system into the dynamics, providing greater<br />
safety information than penalty methods, a regularizing effect, and safety guarantees to complex<br />
dynamical systems in environments with many obstacles. Tolerant discrete barrier states (TDBaS)<br />
approximate the safety guarantees of DBaS while improving exploration, allowing for unsafe initial state<br />
trajectories, and providing several parameters that can be intuitively tuned for any application. This<br />
thesis explores how differentiable trajectory optimization can learn these TDBaS safety parameters<br />
given safety uncertainty in a reinforcement learning setting with limited supervision. Towards this end,<br />
we will explore a variety of strategies and structures for the encoder-decoder network, the dynamics<br />
network, the safety function network, and the differentiable controller such as Parametric Differentiable<br />
Dynamic Programming (PDDP), Pontryagin Differentiable Programming (PDP), Barrier Nets, and<br />
Differentiable MPC. We will test this framework in simulation, and if time allows, on hardware in the<br />
Indoor Flight Laboratory or Robotarium.<br />
Committee<br />
&bull; Prof. Evangelos Theodorou &ndash; School of Aerospace Engineering (advisor)<br />
&bull; Prof. Kyriakos G. Vamvoudakis &ndash; School of Aerospace Engineering<br />
&bull; Prof. Patricio Vela &ndash; School of Electrical and Computer Engineering</p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Integrating Perception into Safe Differentiable Control]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2022-12-13T10:00:00-05:00]]></value>
      <value2><![CDATA[2022-12-13T12:00:00-05:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>166866</tid>
        <value><![CDATA[MS Proposal]]></value>
      </item>
      </field_keywords>
  <userdata><![CDATA[]]></userdata>
</node>
