Robert Grossman
Selected Technical Talks
2008
-
Knowledge Discovery from Distributed Data: Lessons
from the Teraflow Testbed, National Science Foundation,
April 8, 2008, Arlington, Virginia.
2007
-
An Introduction to Data Mining on Grids,
Midwest Grid Workshop, Chicago, March 25, 2007.
-
Hopf Algebras of Labeled Trees
and Some Associated Differential Algebra Structures,
Second International Workshop on
Differential Algebra and Related Topics,
Rutgers University, Newark, New Jersey, April 13, 2007.
-
Modeling Highly Large, Heterogeneous Data Sets: Towards a Billion
Models, DIMACS Workshop on Recent Advances in Mathematics and
Information Sciences for Analysis and Understanding of Massive and
Diverse Sources of Data, Rutgers University, New Brunswick,
May 15, 2007.
-
Unique Keys for Chemical Compounds and Metabolic Pathways,
Interface 2007, Philadelphia, May 25, 2007.
-
Building Statistical Models on Large and Distributed Data,
Analytical Computing Forum, June 28, 2007, Austin, Texas.
-
Data Driven Discovery in E-Science,
Interdisciplinary Strategic Issues in e-Science and
Cyber-Infrastructure, Caltech, June 13, 2007.
-
Detecting Changes in Large Data Sets of Payment Card Data: A Case
Study, Thirteenth ACM SIGKDD International Conference on Knowledge
Discovery and Data Mining, San Jose, CA, August 14, 2007.
-
Distributed Discovery in E-Science: Lessons from the Angle Project,
National Science Foundation Workshop on Next Generation Data Mining
(NGDM '07), Baltimore, Maryland, October 12, 2007.
-
Sector: A Peer-to-Peer Infrastructure for Distributing Large
Scientific Data Sets Over Wide Area High-Performance Networks,
GridNets 2007, Lyon, France, October 17, 2007
-
Angle: Detecting Anomalies and Emergent Behavior
from Distributed Data in Near Real Time, SC 07, Reno, NV,
November 13, 2007.
-
Data Grids, Data Clouds and Data Webs: A Survey of
High Performance and Distributed Data Mining , Workshop on Hardware and
Software for Large-Scale Biological Computing in the Next Decade.
Okinawa, Japan, December 12, 2007.
2006
-
Other People's Petabytes:
The Challenge of Distributed Data Mining and Distributed Data Integration,
Salishan High Speed Computing Conference, April 26, 2006, Salishan, Oregon.
-
Multiscale Analysis Of Data: Clusters, Outliers and Noise - Preliminary Results,
Second NASA Data Mining Workshop: Issues and Applications in Earth Science,
Pasadena, May 24, 2006.
-
Change Detection using Cubes of Models (CDCM),
Interface 2006, 38th Symposium on the interface of statistics,
computing science, and applications, Pasadena, California, May 26,
2006.
-
The Age of Data-Driven Discovery and Decision Support: The New Rules,
The First Vyborny Memorial Lecture, University of Chicago,
July 17, 2006.
-
Using Term Lists and Inverted Files to Improve Search Speed for
Metabolic Pathway Databases, 3rd International Workshop on Data
Integration in the Life Sciences 2006 (DILS'06), July 21, 2006.
-
Sector - An eScience Platform for Distributing
Large Scientific Data Sets, eScience Workshop,
October 13, 2006, Baltimore, Maryland.
-
DataSpace - Data Integration Using Universal Keys,
Workshop on Information Integration, Philadelphia,
October 26, 2006.
-
Transporting the Sloan Digital Sky Surey Using Sector,
SC 06, November 14, 2006.
-
Distributing the Sloan Digital Sky Survey Using UDT and Sector,
Second IEEE International Conference on
e-Science and Grid Computing, Amsterdam, December 4, 2006.
2005
-
Yunhong Gu and Robert L. Grossman, Optimizing UDP-based Protocol
Implementations, Third International Workshop on Protocols for Fast
Long-Distance Networks Lyon, France, February 4, 2005 (presentation
by Michal Sabala).
-
The UDT Project and the Teraflow Testbed,
4th Annual ON*VECTOR International Photonics Workshop,
La Jolla, March 1, 2005.
-
Biowebs and Biogrids of Proteomics Data, Panel Presentation,
Workshop on Proteomics and Informatics sponsored by the Chicago Biomedical
Consortium, Northwestern University, April 22, 2005.
-
An Event Based Framework for Improving Information Quality
That Integrates Baseline Models, Causal Models and Formal Reference
Models, Second International ACM SIGMOD Workshop on Information
Quality in Information Systems, Baltimore, June 17, 2005.
-
Assigning Unique Keys to Chemical Compounds for data
integration: some interesting counterexamples, 2nd International
Workshop on Data Integration in the Life Sciences, University of
California, San Diego, July 22, 2005.
-
High Performance Analytics: Why do Network Protocols and Light Paths Matter?,
iGrid 2005, San Diego, California, September 26, 2005.
-
A Tutorial Introduction to High Performance Analytics,
SC 05, Seattle, November 14, 2005.
-
Real Time Change Detection and Alerts from Highway Traffic Data,
SC 05, Seattle, November 15, 2005.
-
The Teraflow Challenge: High Performance Mining of Streaming Data,
SC 05, Seattle, November 15, 2005.
-
Master Works Talk: Data Mining Challenges: Technical, Pragmatic and Strategic,
SC 05, Seattle, November 16, 2005.
2004
-
UDT: An Application Level Transport Protocol for Grid Computing,
Second International Workshop on Protocols for Fast Long-Distance Networks, PFLDnet 2004,
Argonne National Laboratory, Argonne, Illinois, February 17, 2004.
-
Tera Mining: A Testbed for Distributed Data Mining over
High Performance SONET and Lambda Networks,
NSF Shared Cyberinfrastructure (SCI) Meeting,
Arlington, Virginia, February 19, 2004.
-
Biowebs, UT-ORNL Bioinformatics Summit 2004,
Fall Creek Falls State Park, Pikesville, TN,
March 27, 2004.
-
Using DataSpace Archives to Support Long Term Stewardship
of Remote and Distributed Data,
NASA/IEEE Conference on Mass Storage Systems and Technologies
(MSST2004), College Park, Maryland, USA, April 14, 2004.
-
Open DMIX: High Performance Web Services for Distributed
Data Mining, 2004 SIAM International Conference on
Data Mining (SDM 2004) Workshop on High Performance and Distributed Data Mining,
Orlando, April 24, 2004.
-
Distributed Alert Management Systems,
Rutgers - CIMIC Workshop on Securing Critical Infrastructure
and Resources Protection, Rutgers University, Newark, NJ, June 24, 2004.
-
Some Hopf Algebras of Trees and their Applications,
BIRS Workshop on Combinatorial Hopf Algebras, Banf, British Columbia,
August 28 - September 2, 2004.
-
Highly Scalable, UDT-Based Network Transport Protocols for
Lambda and 10 GE Routed Network,
DOE Office of Science High-Performance Network Research Workshop Ultranet 2004,
Fermi National Laboratory, Bativia, Illinois, September 15, 2004.
-
Unique chemical keys for biomolecules and integration of
distributed data, Symposium on Computational Science of Biomolecules:
Applications in Medicine and Therapeutics, University of Illinois at
Chicago, October 8, 2004.
-
Experiences in the Design and Implementation of a High Performance Transport Protocol,
SC 04, Pittsburgh, November 9, 2004. (Presentation partly by Yunhong Gu.)
2003
- Biowebs, Data mining for the Americas, NSF AMPATH Workshop:
Fostering Collaborations and Next Generation Infrastructure, Florida
International University, Miami, January 29, 2003.
- The OptIPuter Data Stack, OptIPuter Project Meeting,
San Diego, February 6, 2003.
- Data Grids and Beyond,
Global Grid Forum/Internet Society Master Class,
University of Amsterdam, March 25, 2003.
- Global Access to Large Distributed Data Sets using Photonic
Data Services, 20th IEEE Symposium on Mass Storage Systems,
San Diego, April 8, 2003.
- High Performance Data Transport Protocols
employing UDP-based Data Channels and TCP-based Control
Channels, DOE Workshop on Ultra High-Speed Transport Protocols and Network
Provisiong for Large-Science Applications,
Argonne National Laboratory, April 10, 2003.
- Some Case Studies For Alert Management Systems,
DARPA Workshop, BBN, Cambridge, May 27, 2003.
- Beyond Data Grids: Photonic Data Services on Lambda
Grids, Global Grid Forum Plenary Panel Presentation,
Seattle, June 25, 2003.
- High Performance File Transfer Protocols and Congestion
Control Mechanisms Using SABUL, Internet2 Techs
Workshop, Lawrence, Kansas, August 5, 2003.
- Experimental Studies of the Universal Chemical Key (UCK)
Algorithm on the NCI Database of Chemical Compounds Abstract,
The Computational Systems Bioinformatics Conference (CSB),
Stanford, August 12, 2003.
-
Virtual Joins Using Universal Keys: Towards
Data Integration Services in Data Mining Middleware,
Workshop on Data Mining and Exploration Middleware for Distributed and Grid Computing,
Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN,
September 18, 2003.
-
The SABUL Application Library for High Performance Data Transport:
How to Move Very Large Data Sets Over Very Long Distances -
Using Today's Network Infrastructure,
Cern Computing Seminar, Geneva, Switzerland, October 1, 2003.
- Open DMIX - Data Integration and Exploration Services for
Data Grids, Data Web and Knowledge Grid Applications, First
International Workshop on Knowledge Grid and Grid Intelligence
(KGGI 2003), Halifax, Canada, October 13, 2003.
-
Beyond Data Grids: Data Webs, Lambda Grids, and All That,
NASA Information Science and Technology Colloquium Series,
NASA Goddard, November 12, 2003.
-
A Tutorial Introduction to High Performance Data Transport,
Bill Allcock (Argonne National Laboratory), Robert Grossman (University of Illinois at Chicago and
Open Data Partners), and Steven Wallace (Indiana University), SC 03, Phoenix, November 16, 2003.
-
Project DataSpace Bandwidth Challenge Presentation, SC 03, Phoenix, November 18, 2003.
-
HPC Challenge Presentations,
Using Virtual Joins in DataSpace to Mine and Visualize Distributed Data,
SC 03, Phoenix, November 19, 2003.
2002
- Analyzing Remote Data and Mining Distributed Data Using
Data Webs, Department of Computing Seminar,
Imperial College, London, March 22, 2002.
-
Analyzing Remote Data and Mining Distributed Data Using Data Webs
Departmental Colloquia,
Computer Science Department, Indiana University,
April 3, 2002.
-
Analyzing Remote Data and Mining Distributed Data Using Data Webs,
Indiana Pervasive Computing Research Initiative Colloquium,
Indiana University Purdue University at Indiana (IUPUI),
April 4, 2002.
-
Combining Families of Information Retrieval Algorithms Using Meta-Learning,
Second SIAM International Workshop on Text Mining, Arlington, VA,
April 13, 2002.
- Finding Bad Guys in Distributed Streaming Data Sets, Panel
Presentation on Resource and Location Aware Data Mining, Second SIAM
International Workshop on High Performance Data Mining, Arlington, VA,
April 13, 2002.
- Why Neural Networks Won't
Catch Bad Guys, The 2002 Workshop on Information and Data Management IDM 2002
Arlington, Virginia, May 6, 2002.
- An Introduction to High Performance Data Mining
for Homeland Defense, CCR-P/DIMACS Conference
on Mining Massive Data Sets and Streams: Mathematical Methods and
Algorithms for Homeland Defense, Princeton, New Jersey, June 17, 2002.
- The Freeing of Biological Data: From Biological Databases to
Biogrids and Biowebs, Chicago Community Trust Symposium, Chicago,
September 3, 2002.
- DataSpace (demonstration), IGrid 2002, Amsterdam, The
Netherlands, September 25, 2002.
- Photonic Data Services: Integrating Path, Network,
and Data Services, to Support Next Generation Data Mining
Applications, NSF Workshop on Next Generation Data Mining
Applications, Baltimore, Maryland, November 2, 2002.
- High Performance Data Webs (demonstration), SC 02, Baltimore,
Maryland, November 18, 2002.
- High Performance Data Webs on the Terra Wide Data Mining Testbed
(via video), CANARIE Advanced Networks Workshop, Montreal, Quebec,
November 20, 2002.
- High Performance Computing Challenge: Data Exploration on the
Terra Wide Data Mining Testbed, SC 02, Baltimore, Maryland, November 20,
2002.
- Merging Multiple Data Streams on Common Keys
over High Performance Networks, SC 02, Baltimore, Maryland,
November 21, 2002.
- Data Mining and Cyber Threat Analysis: Three Trends,
Workshop on Data Mining for Cyber Threat Analysis, IEEE
International Conference on Data Mining, December 9, 2002
Maebashi City, Japan.
- An Algebraic Approach to Data Mining: Some Examples, IEEE
International Conference on Data Mining, December 10, 2002 Maebashi
City, Japan.
2001
- Project DataSpace: An Infrastructure Supporting Real Time
Analysis and Decision Making with Complex, Distributed Data,
Multi-Sector Crisis Management Consortium (MSCMC),
Alliance Center for Collaboration Education,
Science and Software (ACCESS), Arlington, Virgina,
March 14, 2001.
- Can Data Mining Ever be a Gigabit Application?,
Lessons from DataSpace,
Salishan Conference on High Speed Computing, Glen Eden, Oregon, April 25, 2001.
- Mining Distributed Exabytes of Data, Alliance
All-Hands Meeting, National Center for Supercomputing
Applications, Champaign-Urbanna, Illionis, May 24, 2001.
- Tera-Mining, Star Tap Meeting, INET 2001,
Stockholm, Sweden, June 5, 2001
- Steps Toward Real Time Data Mining,
ICSA 2001, Applied Statistics Symposium, June 7-9, 2001, Chicago.
- An Introduction to PMML, Second Annual Workshop on the Predictive
Model Markup Language, August 26, 2001, San Francisco, California.
- The Data Challenge, Testimony before the NSF Blue Ribbon Advisory Committee
for Cyberinfrastructure, November 29, 2001, Arlington, Virgina.
2000
- Terabyte Challenge 2000: Project DataSpace, Asian Pacific Advanced
Network Workshop, Tsukuba Science City, Japan, February 15, 2000.
- The Terabyte Challenge 2000/Project DataSpace, Workshop on
Scientific Data Management, Minnesota High Performance Computing Center,
July 20, 2000.
- The Terabyte Challenge 2000/Project DataSpace, NASA Ames,
August 14, 2000.
- Distributed and Parallel Data Mining: Advances and Future Directions,
Distributed and High Performance Knowledge Discovery 2000 (DPKD-2000),
ACM Knowledge Discovery in Databases (KDD) 2000 Conference,
Boston, August 20, 2000.
- A Framework for Distributed Data Mining Strategies that are
Intermediate Between Centalized Strategies and In-Place Strategies,
Distributed and High Performance Knowledge Discovery 2000 (DPKD-2000),
ACM Knowledge Discovery in Databases (KDD) 2000 Conference,
Boston, August 20, 2000.
- Introduction to PMML, PMML Workshop, ACM Knowledge Discovery in
Databases (KDD) 2000 Conference, Boston, August 23, 2000.
- A Tutorial on High Performance Data Mining,
Supercomputing 2000 (SC2000), Dallas, November 5, 2000.
- PSockets: The Case for Application-level Network Striping
for Data Intensive Applications using High Speed Wide Area
Networks, Supercomputing 2000 (SC2000), Dallas, November 8, 2000.
1999
- The Inevitable Emergence of Data Mining, Chicago Chapter of the
American Statistical Association, East Bank Club, January 12, 1999.
- Terabyte Challenge 20000, Abilene Launch Event, Washington, D.C.,
February 24, 1999.
- The Terabyte Challenge: A Testbed for High Performance and
Distributed Data Mining, Post-vBNS NSF-CRA Invitational Workshop, La
Jolla, March 1, 1999.
- The Predictive Model Mark up Language (PMML), R. L. Grossman and
M. Cornelson (presentation by M. Cornelson) 1999 AFCEA Federal Data
Mining Symposium and Exposition, Tysons Corner, Virginia, March 9-10, 1999.
- Mining Collection of Trajectories, SIAM 1999 International Conference
on Parallel Processing (ICPP), San Antonio, March 23, 1999.
- Data Mining: Issues and Challenges in Wide Area
Distributed Data Mining, KDD 99 Workshop on High Performance Data
Mining, San Diego, August 15, 1999.
- A High Performance Implementation of the Data Space Transfer
Protocol (DSTP), KDD 1999 Workshop on High Performance Data Mining, San
Diego, August 15, 1999.
- Terabyte Challenge 2000: Project DataSpace, AHPRC Workshop,
Minneapolis, September 9, 1999.
- Terabyte Challenge 2000: Project DataSpace, National Science
Foundation, Arlington, VA, November 9, 1999.
- A Tutorial on High Performance Data Mining,
Supercomputing 1999, Portland, November 15, 1999.
- Papyrus: A System for Data Mining over Local and Wide Area Clusters
and Super-Clusters, Supercomputing 1999, Portland, November 16, 1999.
- Terabyte Challenge 2000: Project DataSpace, SuperComputing 99
Conference - High Performance Computing Challenge, Portland, November
17, 1999.
1998
- Combing Data Mining and Predictive Modeling, Advanced Information
Processing and Analysis Steering Group (AIPA 98) Conference , Tysons
Corner, Virgina, March 17-18, 1998.
- A Tutorial Introduction to High Performance Data Mining, Sixth
NASA Goddard Space Flight Center Conference on Mass Storage and
Technologies and Fifteenth IEEE Symposium on Mass Storage Systems,
College Park, Maryland, March 23, 1998.
- Scaling Tree-based Classifiers, Second Pacific-Asia Conference on
Knowledge Discovery and Data Mining, Melbourne, April 12-19, 1998.
- Scaling Tree-based Classifiers, DIMACS Workshop on High
Performance Data Mining, Princeton, April 26-28, 1998.
- The Inevitable Emergence of Data Mining, UCAID/Internet 2 Quality
of Service Workshop, Santa Clara, May 20-22, 1998.
- The Inevitable Emergence of Data Mining, Army Research Center
(ARL) and US Army Test and Evaluation Command (TECOM) Workshop on
Computing, Aberdeen, Maryland, August 19-21, 1998.
- A Tutorial Introduction to High Performance Data Mining, AAAI
1998 Conference on Knowledge Discovery and Data Mining (KDD-98), New
York City, August 27-31, 1998.
- Data Mining on Clusters, Super-Clusters, and Meta-Clusters, 1998
Asian Conference on High Performance Computing, Singapore, September
22-27, 1998.
- Data Mining on Clusters, Super-Clusters, and Meta-Clusters, RCI
Conference on High Performance Computing, Pentagon City, October 14, 1998.
- The Terabyte Challenge: A Testbed for High Performance and
Distributed Data Mining, Research Demonstration, 1998 Supercomputing
Conference (SC-98), Orlando, November 8 - November 12, 1998.
- The Terabyte Challenge: A Testbed for High Performance and
Distributed Data Mining, Research Demonstration, IBM CASCON Conference,
Toronto, November 30 - December 3, 1998.
- The Terabyte Challenge: A Global Testbed for High Performance and
Distributed Data Mining, 2nd annual CA*net Workshop, Ottawa, December
15-16, 1998.
1997
- High Performance Data Mining, Tandem Computer, Austin, Texas,
January 31, 1997.
- Four one hour talks on data mining and related topics: 1) An
Overview to Data Mining, Data Warehousing and Intelligent Agents, 2)
An Introduction to Data Mining, 3) An Introduction to High
Performance Data Warehouses, 4) Integrated Architectures for Data
Mining, Department of Defense, Fort Meade, Maryland, March 6, 1997.
- Detecting Network Intrusions Using Data Mining, Sixth Annual
Symposium on Advanced Information Processing and Analysis, Tysons
Corner, Virginia, March 26, 1997.
- The Old Order Changeth Yielding Place to the New: The Rise of High
Performance Data Management and the Demise of High Performance
Computing, Pittsburgh Supercomputing Center, March 29, 1997.
- The Data Mining and Analysis of Packet Data for Detecting Network
Intrusion, Eleventh International Conference on Mathematical Modeling
and Scientific Computing, Washington, DC, April 1, 1997.
- Data Mining in Financial Services,
Financial Services Technology Consortium (FSTC)
General Meeting, Orlando, FL, April 17, 1997.
- Using Data Mining to Detect Network Intrusion,
Practical Applications of Data Mining and Knowledge Discovery
PADD 97, London, England, April 25, 1997.
- An Introduction to Data Mining, Center for Communication Research,
Princeton, NJ, May 1, 1997.
- High Performance Data Mining: A Tutorial Introduction, First
European Symposium, Principles of Data Mining and Knowledge Discovery,
PKDD 97, Trondheim, Norway, June 25-27, 1997.
- Dynamic Similarity: Mining Collections of Trajectory Segments,
NSF Workshop on the Mathematics of Mining Massive Data, Chicago, Illinois
July 12-15, 1997.
- JTool: Accessing Warehoused Collections of Objects with
Java, Persistent Java Workshop 2, Half Moon Bay, California,
August 13-15, 1997.
- An Introduction to Data Mining,
NASA-CESDIS Workshop on Data Mining and Data Warehousing,
Greenbelt, Maryland, August 19-21, 1997.
- Detecting Network Intrusions Using Data Mining,
Seminar at Boeing Research, September 2, 1997.
- Data Minining Scientific and Engineering Data, NSF Mathematical
and Physical Sciences Distinguished Lecturer, Arlington, Virgina,
September 30, 1997.
- High Performance Data Mining, CASCON 97, Toronto, Ontario,
November 11, 1997.
- High Performance Data Mining: A Tutorial Introduction,
Supercomputing 97, San Jose, California, November 16, 1997.
- The Terabyte Challenge: An Open, Distributed Testbed
for Managing and Mining Massive Data Sets, Supercomputing 96,
Pittsburgh, November 19, 1997.
- The Terabyte Challenge: An Open, Distributed Testbed for Managing and
Mining Massive Data Sets, Supercomputing 96, Pittsburgh, November 19,
1997.
1996
- The Symbolic Computation of Differential Invariants
Using Trees, Closing session of the special year on Computational
Differential Algebra and Algebraic Geometry, City College of
New York, January 5, 1996.
- An Informal Introduction to Mathematics of
Data Mining, La Jolla, February 14, 1996.
- The Old Order Changeth: The Rise of High Performance
Data Management in Scientific Computing, colloquium at the Pittsburgh
Supercomputing Center, March 29, 1996.
- Computing Differential Invariants with Trees, Special
Session on Differential Algebra at the AMS Regional Meeting,
New York City, New York, April 13, 1996.
- Data Mining, Object Warehouses, and Persistent Object Managers, Seventh
International Conference on Persistent Object Systems, Cape May, New
Jersey, May 30, 1996.
- Optimization Driven Data Mining and Object Warehouses, SIGMOD 96
Workshop on Data Mining, Monreal Quebec, June 2, 1996.
- Data Minng Challenges for Digital Libraries and Electronic Commerce,
MIT-ACM 50th Anniversity of the ACM, Boston, June 14, 1996.
- Accessing Warehoused Collections of Objects Through Java,
First International Conference on Persistent Java, Glascow, Scottland,
September 17, 1996.
- Discovering Critical Patterns in Large Data Sets, NSF Workshop on
Data Mining, Arlington, Virginia, September 25, 1996.
- Mode Shifting, Mode Sharing, and Mode Superposition,
Hybrid Systems IV (HSAC 96), Ithaca, New York, October 14, 1996.
- Data Mapping Support for Data Mining Applications, NASA-CESDIS
Workshop on Data Mapping, CESDIS, Greenbelt, Maryland, November 7, 1996.
- Managing, Mining, Querying and Analyzing Very Large Data Sets,
CASCON 96, Toronto, November 13, 1996.
- High Performance Data Mining, Australian National University,
December 17, 1996.
This is from www.rgrossman.com