<node id="671232">
  <nid>671232</nid>
  <type>event</type>
  <uid>
    <user id="28475"><![CDATA[28475]]></user>
  </uid>
  <created>1701103928</created>
  <changed>1701103950</changed>
  <title><![CDATA[Ph.D. Dissertation Defense - Milad Ghiasi Rad]]></title>
  <body><![CDATA[<p><span><span><strong><span>Title</span></strong><em><span>:&nbsp; </span></em><em><span>Improvements in the Modeling of High Dimension/Low Sample Size Imbalanced Clinical Data Sets</span></em></span></span></p>

<p><span><span><strong><span>Committee:</span></strong></span></span></p>

<p><span><span><span>Dr. </span><span>Rishikesan Kamaleswaran, BME, Chair</span><span>, Advisor</span></span></span></p>

<p><span><span><span>Dr. </span><span>Omer Inan, ECE</span><span>, Co-Advisor</span></span></span></p>

<p><span><span><span>Dr. </span><span>David Anderson, ECE</span></span></span></p>

<p><span><span><span>Dr. </span><span>Jocelyn Grunwell, Emory</span></span></span></p>

<p><span><span><span>Dr. </span><span>Soheli Saedi, Florida Tech</span></span></span></p>

<p><span><span><span>Dr. </span><span>Tony Pan, Emory</span></span></span></p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Improvements in the Modeling of High Dimension/Low Sample Size Imbalanced Clinical Data Sets ]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>This dissertation has tried to tackle three common problems in the field of biomedical research data analytic. The problems that are covered in this document fall around the concept of imbalance and high dimensionality that is in the nature of the datasets that are gathered in this field of research, although they can be extended to other fields as well. Both imbalance and high dimensionality suffer this area of research by impacting the predictive models negatively resulting in introduction of over-fit, or noise in the models. First high dimensionality of the whole blood gene expression arrays are addressed and a new approach using Stability Selection has been proposed to reduce the dimension of these datasets. Then a novel pipeline to combine single-cohort studies into multi-cohort studies is proposed. The pipeline is tested on two GSE datasets which verified the proposed approach. Then, Stability Selection was used to be combined with SMOGN to boost the performance of regression in imbalanced, small, and horizontal datasets. The increase in the regression accuracy enabled further discoveries on AirPICU dataset which and showed the importance of PRISM as a very effective predictor of Ventilation Free Days which indirectly indicates the mortality chance. Finally, the use of unsupervised generative models like CTGAN and TVAE was investigated to reduce the imbalance in imbalanced datasets. It was observed that CTGAN is a powerful model that can improve the performance of SMOTE in imbalance removal both with over-sampling and complete synthetic data modeling.</p>
]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2023-12-04T13:00:00-05:00]]></value>
      <value2><![CDATA[2023-12-04T15:00:00-05:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[Online]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
          <item>
        <url>https://teams.microsoft.com/l/meetup-join/19%3ameeting_OTMxOWE2NjMtMTQxNi00MmFlLTg1OWYtYzg3YTMyYjJiZWYx%40thread.v2/0?context=%7b%22Tid%22%3a%22e004fb9c-b0a4-424f-bcd0-322606d5df38%22%2c%22Oid%22%3a%220cd997d2-7ff5-4017-9df0-abf7dc8e4342%22%7d</url>
        <link_title><![CDATA[Microsoft Teams Meeting link]]></link_title>
      </item>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>434381</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[ECE Ph.D. Dissertation Defenses]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
          <item>
        <tid>1808</tid>
        <value><![CDATA[graduate students]]></value>
      </item>
      </field_keywords>
  <userdata><![CDATA[]]></userdata>
</node>
