Robert Grossman's Home Page
This web site contains some technical publications, talks, and FAQs about computing with data written by Robert Grossman. There are over 100 publications available online.
There is also a list of publications by topics and a list of technical reports.
The site was last updated on March 24, 2007.
Recent News
- Sector vs Hadoop. In a recent paper, we describe the design and architecture of Sector. The paper also describes some preliminary experimental studies comparing the performance of Sector and Hadoop. On the clusters and distributed clusters used, Sector was about twice as fast as Hadoop on the Terasort Benchmark. Sector is designed to be used on clusters within a data center, as well as on distributed clusters across data centers that are connected by wide area area high performance 10 Gbps networks. The paper can be found here.
- Sector Version 1.5 Released. Version 1.5 of Sector was released on March 18, 2008. It can be obtained from Source Forge at the project site sector.sf.net. Sector is a wide area high performance storage and compute cloud. For the past couple of years, Sector has been used to distribute the Sloan Digital Sky Survey (SDSS) via the web site sdss.ncdm.uic.edu The current version of Sector also includes high performance distributed computing services.
- UDT, Version 4 released. UDT is an application layer high performance network transport protocol that is available from Source Forge at udt.sf.net. Version 4 of UDT was recently released.
- UDT will be part of Globus. Beginning with Globus Version 4.2, one can choose an option in GridFTP so that TCP is replaced with UDT, which will speed up large data transfers.
- Recent Award. On November 15, 2007, The Angle Project won First Place in the 2007 Analytics Challenge at the ACM/IEEE International Conference for High Performance Computing and Communications 2007 (SC07). The title of the project was "Angle: Detecting Anomalies and Emergent Behavior from Distributed Data in Near Real Time."
- Recent Award. In July 2007, I was awarded the ACM Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD) Service Award for my "... role in the development of open and scalable architectures and standards for the SIGKDD and Global KDD Communities."
- Recent Award. The paper "Data Quality Models for High Volume Transaction Streams: A Case Study" by Joesph Bugajski, Robert Grossman, Chris Curry, David Locke and Steve Vejcik won the second annual Data Mining Practice Prize at KDD 2007. The prize is awarded each year "for work that has had a significant and quantitative impact in the application in which it was applied."
- New book. I have just finished writing a book called Digital Beauty. There is a more information here.
- More news. For more news, see the News section.
About the Author
Robert Grossman is the Managing Partner of Open Data Group, helping companies increase revenues and decrease costs through a better understanding of their data. Please contact him at info at opendatagroup dot com if you would like to work with him.
He is also the Director of the Laboratory for Advanced Computing (LAC) and the National Center for Data Mining (NCDM) at the University of Illinois at Chicago. Please contact info at lac.uic.edu if you would like more information about the Laboratory or Center.
Biographical material can be found here.
Finding Material on this Site
You can use Google to search for a particular term on this site by entering the term below: