{"283621":{"#nid":"283621","#data":{"type":"event","title":"SCS Talk: Ymir Vigfusson, Assistant Professor, Reykjavik University","body":[{"value":"\u003Cp\u003E\u003Cstrong\u003ESPEAKER:\u0026nbsp;\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003EYmir Vigfusson, Assistant Professor, Reykjavik University\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003ETITLE:\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003EDynamically Profiling the Distributed Cache Tier in Data Centers\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EABSTRACT:\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003EScalable data replication protocols and layers, such as streaming, multicast and caching, enable many large distributed systems to be practical. As a concrete example, large-scale in-memory object caches like memcached are now widely used to accelerate popular web sites and to reduce burden on backend databases. Yet operators still have limited visibility into how these caches should be set up to optimally accommodate the workloads they see. How much would the cache performance improve from additional cache space, or by adding more cache servers to the pool? Since resources come at a cost, to what extent would request latencies deteriorate if cache memory were repurposed for a different service?\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003EIn this talk, I\u0027ll focus on some of the latest research questions pertaining to scalable data replication and large-scale distributed caches. In particular, I\u0027ll home in on the challenge of providing online monitoring of the cost and benefits of memory space in a large-scale cache, enabling cache operators to answer the questions above without requiring extraneous trace collection and manual offline tuning. I will introduce general and efficient algorithms for dynamically estimating hit rate curves -- histograms of cache hit rate as a function of memory size -- which can be plugged into cache replacement policies such as LRU.\u0026nbsp;\u003C\/p\u003E\u003Cp\u003EExtensive simulations on cache benchmarks indicate that these methods provide accurate estimates of hit rate at different cache sizes. Experiments on an implementation of these methods in memcached showed that hit rate curves were dynamically estimated at over 98% accuracy with only a small drop in throughput. The results are encouraging and suggest that exposing hit rate curves can be a practical method for improving provisioning and metering of large-scale caches.\u003C\/p\u003E\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EBIO:\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003EProf.\u003Cstrong\u003E\u0026nbsp;\u003C\/strong\u003EYmir\u003Cstrong\u003E\u0026nbsp;\u003C\/strong\u003EVigfusson\u0026nbsp;has been an Assistant Professor in the School of Computer Science at Reykjavik University since 2011. Ymir received his B.Sc. in Mathematics from the University of Iceland (2005) and his Ph.D. in Computer Science from\u0026nbsp;Cornell\u0026nbsp;University (2009), where he researched approaches to exploit group similarity and improve scalability in distributed systems. His dissertation was nominated for the ACM\u0026nbsp;Doctoral\u0026nbsp;Dissertation Award by\u0026nbsp;Cornell. Ymir was a post-doctoral\u0026nbsp;scientist at\u0026nbsp;IBM\u0026nbsp;Research (2009-2011) where he worked on caching services in the cloud and the security of financial institutions. Ymir\u0027s research projects include creating and optimizing systems and algorithms for distributed settings, and getting data replication to work in a variety of environments. He founded Syndis in 2013, a growing start-up focused on offensive security and attack services for high-value clients. Ymir\u0027s work has been partially supported by a Fulbright Scholarship, a Yahoo! Research grant and a Grant-of-Excellence from the Icelandic Research Centre.\u003C\/p\u003E","summary":null,"format":"limited_html"}],"field_subtitle":"","field_summary":"","field_summary_sentence":[{"value":"Dynamically Profiling the Distributed Cache Tier in Data Centers"}],"uid":"27734","created_gmt":"2014-03-14 13:49:13","changed_gmt":"2016-10-08 02:07:14","author":"Antonette Benford","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2014-03-27T16:00:00-04:00","event_time_end":"2014-03-27T17:00:00-04:00","event_time_end_last":"2014-03-27T17:00:00-04:00","gmt_time_start":"2014-03-27 20:00:00","gmt_time_end":"2014-03-27 21:00:00","gmt_time_end_last":"2014-03-27 21:00:00","rrule":null,"timezone":"America\/New_York"},"extras":["free_food"],"hg_media":{"283521":{"id":"283521","type":"image","title":"Ymir Vigfusson- Asst. Professor, Reykjavik University","body":null,"created":"1449244199","gmt_created":"2015-12-04 15:49:59","changed":"1475894978","gmt_changed":"2016-10-08 02:49:38","alt":"Ymir Vigfusson- Asst. Professor, Reykjavik University","file":{"fid":"198985","name":"ymir_vigfusson.jpg","image_path":"\/sites\/default\/files\/images\/ymir_vigfusson_0.jpg","image_full_path":"http:\/\/www.tlwarc.hg.gatech.edu\/\/sites\/default\/files\/images\/ymir_vigfusson_0.jpg","mime":"image\/jpeg","size":36881,"path_740":"http:\/\/www.tlwarc.hg.gatech.edu\/sites\/default\/files\/styles\/740xx_scale\/public\/images\/ymir_vigfusson_0.jpg?itok=FlMUQpLE"}}},"media_ids":["283521"],"groups":[{"id":"47223","name":"College of Computing"}],"categories":[],"keywords":[],"core_research_areas":[],"news_room_topics":[],"event_categories":[{"id":"1795","name":"Seminar\/Lecture\/Colloquium"}],"invited_audience":[{"id":"78771","name":"Public"}],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[{"value":"\u003Cp\u003EKarsten Schwan @ (404) 894-2589 or \u003Ca href=\u0022mailto:milos@cc.gatech.edu\u0022\u003Ekarsten.schwan@cc.gatech.edu\u003C\/a\u003E\u003C\/p\u003E","format":"limited_html"}],"email":[],"slides":[],"orientation":[],"userdata":""}}}