Soumyadeb Mitra

 
144 S. 3rd Street, #335.
San Jose, CA 9511




About Me

I graduated with a PhD from the Department of Computer Science of University of Illinois, Urbana Champaign.   My advisor was Prof. Marianne Winslett. Earlier, I had reveived my undergraduate in Computer Science and Engineering from Indian Institute of Technology (IIT), Delhi.

Right now, I am working on my stealth mode startup, Credool. Prior to this, I was the tech-lead for DD-Archiver product line at Data Domain/EMC.

PhD Thesis(pdf)

CV (txt/pdf)


PhD Research

My research was focused on  "Compliance Records" -  Records, such as business communications, financial statements and medical images, which are increasingly being stored in electronic form.  Ensuring that such records are not only readily accessible and accurate, but also credible and irrefutable, is particularly imperative given recent legal and regulatory trends (Sarbanes-Oxley Act, SEC Rule 17a-3/4, HIPPA, DOD 5015.2). In my PhD, I developed techniques for secure creation, maintenance, retrieval, migration and eventual shredding of such compliance records.

Apart from this, I have also worked on Maitri: A data-management system for  scientific data and LBIO: A user space parallel I/O routine for cluster computers.


Awards

  • ACM SIGMOD Jim Gray Doctoral Dissertation Award - honorable mention, 2009.
  • IBM PhD Fellowship, 2008.
  • Best paper award in VLDB, 2006.
  • Best paper award in Storage Security and Survivability Workshop, 2006.
  • Institute silver medal of IIT Delhi (for securing 1st position in the dual-degree batch), 2002.
  • CISCO fellowship at IIT Delhi, 2001-2002.
  • 1st rank in predefined software design contest in Tryst (IIT Delhi's technical festival), 1999.
  • 3rd rank in the Regional Mathematical Olympiad in Delhi, 1997.

Publications

2008

  • An Architecture for Regulatory Compliant Database Management.
    Soumyadeb Mitra, Marianne Winslett, Richard Snodgrass, Shashank Yaduvanshi and Sumedh Ambokar.. ICDE 2009
  • Query-based Partitioning of Documents and Indexes for Information Lifecycle Management.
    Soumyadeb Mitra, Marianne Winslett and Windsor Hsu. SIGMOD 2008
  • Deleting Index Entries from Compliance Storage
    Soumyadeb Mitra, Marianne Winslett and Nikita Borisov. Extending Data Base Technology (EDBT) 2008

2007

  • Trustworthy Migration and Retrieval of Regulatory Compliant Records. Soumyadeb Mitra, Marianne Winslett, Windsor H Hsu, Xiaonan Ma. IEEE Conference on Mass Storage Systems and Technologies (MSST) 2007
  • Trustworthy Keyword Search for Compliance Storage. Soumyadeb Mitra, Marianne Winslett, Windsor H Hsu, Kevin C.-C. Chang. In The International Journal on Very Large Data Bases, 2007.

2006

  • Trustworthy Keyword Search for Regulatory Compliant Record Retention. Soumyadeb Mitra, Windsor H. Hsu and Marianne Winslett. VLDB 2006.   Best Paper Award
  • Secure Deletion from Inverted Indexes on Compliance Storage. Soumyadeb Mitra and Marianne Winslett. Storage Security Workshop 06, in conjunction with CCS06.   Best Paper Award
  • Bitmap Indexes for large Scientific Data Sets: A case study. Rishi Rakesh Sinha, Soumyadeb Mitra, Marianne Winslett. IPDPS, 2006

2005

  • An Efficient, Non Intrusive, Log Based I/O Mechanism for Scientific Simulations on Clusters. Soumyadeb Mitra, Rishi R Sinha, Marianne Winslett, Xiangmin Jiao, Cluster 2005 Boston.
  • Maitri: A Format independent Data Management System for Scientific Data. Rishi Rakesh Sinha, Soumyadeb Mitra, Marianne Winslett. SNAPI workshop at PACT, 2005.

Hobbies and Interests

I have keen interest in outdoor sports like soccer, running and biking. Recently, I completed the North Shore Century, a 100 mile biking event in Evanston, IL in 7 hrs 49 mins. Earlier, I had run the Chicago Marathon in 2005.

Travelling is my other big hobby. Although, I haven't travelled much in the US (thanks to PhD workload), I have toured a lot in India. Here is a list of Indian states I have visited, where visiting is defined as spending atleast a night not counting overnight train journeys.

Some Quotes