Current Work
I am currently developing Themis, the successor to
TritonSort.
Themis is a framework for building efficient and balanced distributed
applications. We have a MapReduce implementation built on top of Themis, which
we are actively working to improve in terms of balance, speed, and robustness.
Past Work
I developed the world's fastest sorting system,
TritonSort. TritonSort
achieves record speeds by focusing on per-disk and per-node efficiency.
TritonSort's goal is to sort data at the speed of the disks by keeping all disks
constantly reading or writing data in large contiguous chunks. TritonSort set
world records in the 2010 and 2011
Sortbenchmark.org competitions. We hold
a total of seven world records: 2010 Indy GraySort, 2010 Indy MinuteSort, 2011
Daytona GraySort, 2011 Indy GraySort, 2011 Indy MinuteSort, 2011 Daytona 100TB
JouleSort, 2011 Indy 100TB JouleSort. Five of these are current world records.
TritonSort was able compete in the Daytona category after we implemented
MapReduce on top of its core components. Future work consists of verifying
efficiency and balance in general MapReduce applications.
I held a Software Engineering Intern position at Google during the summer of
2011 working in the MapReduce group with my mentor
Marian Dvorsky.
I held a Software Engineering Intern position at Google
during the summer of 2010 working in the search infrastructure group
with my mentor Alexander
Yip.
During summer of 2009, I investigated balanced systems
within the MapReduce framework of Hadoop.
Goals consisted of analyzing cluster resources during a MapReduce job,
identifying bottlenecks, and classifying various types of jobs according to
these bottlenecks with the hope of being able to utilize cluster resources more
efficiently.