Robert Grossman
Publications - By Topic
Data Mining
-
Robert L Grossman and Yunhong Gu,
Data Mining Using High Performance Clouds:
Experimental Studies Using Sector and Sphere,
Proceedings of The 14th ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining, ACM, 2008.
Draft
-
Joseph Bugajski, Chris Curry, Robert L. Grossman, David Locke, Steve
Vejcik, Detecting Changes in Large Data Sets of Payment Card Data: A
Case Study, Proceedings of The Thirteenth ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining, ACM, 2007.
Draft
-
Joseph Bugajski and Robert L. Grossman, An Alert Management Approach
to Data Quality: Lessons Learned from the Visa Data Authority Program,
Proceedings of the 12th International Conference on Information
Quality, (ICIQ 2007).
Draft
-
Joseph Bugajski, Chris Curry, Robert L. Grossman, David Locke and
Steve Vejcik, Data Quality Models for High Volume Transaction Streams:
A Case Study, Proceedings of the Second Workshop on Data Mining Case
Studies and Success Stories, ACM 2007.
Draft
-
Robert L. Grossman, A Review of Some Analytic Architectures for High
Volume Transaction Systems, The 5th International Workshop on Data
Mining Standards, Services and Platforms (DM-SSP '07), ACM, 2007,
pages 23-28.
Draft
-
Chetan Gupta and Robert L. Grossman, Outlier Detection with Streaming
Dyadic Decomposition, Proceedings of the 7th Industrial Conference on
Data Mining, LNCS Volume 4597, Springer-Verlag, 2007, pages 77-91.
Draft
-
Robert L. Grossman, Fifth International Workshop on Data Mining
Standards, Services, and Platforms, Preface, Proceedings of The
Thirteenth ACM SIGKDD International Conference on Knowledge Discovery
and Data Mining, ACM, 2007.
Draft
-
Leland Wilkinson, Anushka Anand and Robert L Grossman, High-dimensional Visual
Analytics: Interactive Exploration Guided by Pairwise Views of Point
Distribution, IEEE Transactions on Visualization and
Computer Graphics, Volume 12, Number 6, pages 1363-1372, 2006.
Draft
-
Robert L. Grossman, Yunhong Gu, David Handley, and Michal Sabala
Joe Mambretti, Alex Szalay and Ani Thakar,
Kazumi Kumazoe and Oie Yuji,
Minsun Lee, Yoonjoo Kwon, and Woojin Seok, Data Mining Middleware for
Wide Area High Performance Networks,
Journal of Future Generation Computer Systems (FGCS), Volume 22, Number 8, pages 940-948, 2006.
Draft
-
Yong Mao, Yunhong Gu, Jia Chen and Robert L. Grossman,
SDCS: Simplified Data Communications in Parallel/Distributed Applications,
IEEE International Symposium on Cluster Computing and the Grid (CCGrid06),
pages 292-295, 2006.
Draft
-
Greeshma Neglur, Robert L. Grossman, Natalia Maltsev, and Clement Yu,
Using Term Lists and Inverted Files to Improve Search Speed for
Metabolic Pathway Databases, 3rd International Workshop on Data
Integration in the Life Sciences 2006 (DILS'06), Lecture Notes in
Bioinformatics, Volume 4075, Springer-Verlag, Berlin, 2006, pages
168-184.
Draft
-
Joseph Bugajski, Robert L. Grossman, Eric Sumner and Steve Vejcik,
Monitoring Data Quality for Very High Volume Transaction Systems,
Proceedings of the 11th International Conference on Information Quality,
2006.
Draft
-
David Ferrucci, Robert L. Grossman, Anthony Levas,
PMML and UIMA Based Frameworks For Deploying Analytic Applications nd Services,
Proceedings of the 4th International Workshop on Data Mining Standards,
Services and Platforms (DM-SSP 06), ACM, New York, 2006, pages 14-26.
Draft
-
Robert L. Grossman, Yunhong Gu, Michal Sabala,
and Joel J. Mambretti,
Real Time, Distributed Detection of Anomalies
and Emergent Behavior Using the Angle Algorithm,
University of Illinois at Chicago, Laboratory
for Advanced Computing, Technical Report, 2006.
Draft
-
Greeshma Neglur and Robert L. Grossman,
Assigning Unique Keys to Chemical Compounds for Data Integration: Some
Interesting Counter Examples, 2nd International Workshop on
Data Integration in the Life Sciences (DILS 2005),
La Jolla, July 20-22, 2005.
Draft
-
Joseph Bugajski, Robert L. Grossman, Eric Sumner and Tao Zhang,
An Event Based Framework for Improving Information Quality That Integrates
Baseline Models, Causal Models and Formal Reference Models,
Second International ACM SIGMOD Workshop
on Information Quality in Information Systems (IQIS 2005),
June 17th, Baltimore, Maryland,
co-located with ACM SIGMOD/PODS 2005.
Draft
-
Robert L. Grossman, Michal Sabala,
Javid Alimohideen, Anushka Aanand, John Chaves, John Dillenburg,
Steve Eick, Jason Leigh,
Peter Nelson, Mike Papka, Doug Rorem, Rick Stevens,
Steve Vejcik, Leland Wilkinson, and Pei Zhang,
Real Time Change Detection and Alerts from Highway Traffic Data,
ACM/IEEE International Conference for High Performance Computing and Communications (SC '05).
Draft
-
Yunhong Gu and Robert Grossman,
Supporting Configurable Congestion Control in Data Transport Services,
ACM/IEEE International Conference for High Performance Computing and Communications (SC '05).
Draft
-
Joseph Bugajski, Robert Grossman, Eric Sumner,
Tao Zhang, A Methodology for Establishing Information Quality
Baselines for Complex, Distributed Systems, 10th International
Conference on Information Quality (ICIQ), 2005.
Draft
-
L. Wilkinson, A. Anand and R. Grossman, Graph-theoretic scagnostics,
Proceedings of the IEEE Information Visualization 2005
(INFOVIS'05), pages 157-164.
Draft
-
Rajmonda Sulo, Stephen Eick, Robert Grossman,
DaVis: A tool for Visualizing Data Quality,
Proceedings of the IEEE Information Visualization 2005
(INFOVIS'05).
Draft
-
Bing Liu, Robert L. Grossman and Yanhong Zhai,
Mining Web Pages for Data Records,
IEEE Intelligent Systems, November/December, 2004, pages 49-55.
Draft
-
Robert L. Grossman, Yunhong Gu, Dave Hanley, Xinwei Hong, Dave
Lillethun, Jorge Levera, Joe Mambretti, Marco Mazzucco, and Jeremy
Weinberger, Photonic Data Services: Integrating Path, Network and Data
Services to Support Next Generation Data Mining Applications,
Data Mining: Next Generation Challenges and Future Directions,
H. Kargupta, A. Joshi, K. Sivakumar, and Y. Yesha, editors,
AAAI Press, 2004.
Draft
-
Robert L. Grossman,
Alert Management Systems: A Quick Introduction,
in Managing Cyber Threats: Issues, Approaches and Challenges,
edited by Vipin Kumar, Jaideep Srivastava and Aleksandar Lazarevic,
Springer Science+Business Media, Inc., New York, 2005, pages 281-291,
ISBN 0-387-24226-0.
Draft
-
Chetan Gupta and Robert L. Grossman,
GenIc: A Single Pass Generalized Incremental Algorithm for Clustering,
2004 SIAM International Conference on Data Mining (SDM 04),
to appear.
Draft
-
Robert L. Grossman, Yunhong Gu, Chetan Gupta, David Hanley,
Xinwei Hong, and Parthasarathy Krishnaswamy,
Open DMIX: High Performance Web Services for Distributed Data Mining,
7th International Workshop on High Performance and Distributed Mining,
in association with the
Fourth International SIAM Conference on Data Mining, 2004.
Draft
-
Jorge Levera, Benjamin Barin, and Robert Grossman,
Experimental Studies Using Median Polish Procedures to
Reduce Alarm Rates in Data Cubes of Intrusion Data,
Intelligence and Security Informatics for National and Homeland
Security, Hsinchun Chen, Reagan Moore, Daniel Zeng, John Jeavitt, editors,
LNCS 3073, Springer Verlag, New York, 2004, pages 482-491.
Draft
-
Robert L. Grossman, Dave Hanley, Xinwei Hong and Parthasarathy
Krishnaswamy, Using DataSpace to Support Long-Term Stewardship of
Remote and Distributed Data, NASA/IEEE MSST 2004, 12th NASA
Goddard/21st IEEE Conference on Mass Storage Systems and
Technologies, 2004, pages 239-244.
Draft
-
Andrei L. Turinsky and Robert L. Grossman, A Greedy Algorithm for
Selecting Models in Ensembles,
Proceedings 4th IEEE International Conference Data Mining (ICDM 2004),
Brighton, UK, pages 547-550, IEEE Computer Society Press, 2004.
Draft
-
Parthasarathy Krishnaswamy, Stephen G. Eick, Robert L Grossman,
Visual Browsing of Remote and Distributed Data,
IEEE Symposium on Information Visualization (INFOVIS'04), 2004,
page 12.
Draft
-
Robert L. Grossman, Pavan Kasturi, Donald Hamelberg, Bing Liu,
An Empirical Study of the Universal Chemical Key Algorithm
for Assigning Unique Keys to Chemical Compounds,
Journal of Bioinformatics and Computational Biology, 2004,
Volume 2, Number 1, 2004, pages 155-171.
Draft
-
M. Cornelson, E. Greengrass, R. L. Grossman, R. Karidi, and D.
Shnidman, Combining Information Retrieval Algorithms Using Machine
Learning, Survey of Text Mining: Clustering, Classification, and Retrieval
Michael W. Berry, editor, Springer-Verlag, 2003, pages 159-169.
Draft
-
Robert L. Grossman, Yunhong Gu, Dave Hanley, Xinwei Hong,
Dave Lillethun, Jorge Levera, Joe Mambretti, Marco Mazzucco, and
Jeremy Weinberger, Global Access to Large Distributed Data Sets
using Photonic Data Services, Proceedings of the 20th IEEE/11th
NASA Goddard Conference on Mass Storage Systems and Technologies
(MSST 2003), IEEE Computer Society, Los Alamitos, California,
pages 62-66.
Draft
-
Bing Liu, Robert L. Grossman and Yanhong Zhai,
Mining Data Records in Web Pages,
Proceedings of The Ninth ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining (KDD 03), pages 601-606.
Draft
-
KDD-2003 Workshop on Data Mining Standards, Services,
and Platforms (DM-SSP 03), ACM SIGKDD Explorations,
Volume 5, Issue 2, page 197, 2003.
Draft
-
R. L. Grossman, S. Mehta and X. Qin,
Path planning by querying persistent stores of
trajectory segments, Laboratory for Advanced Computing
Technical Report Number LAC 93-R3, September, 1992.
Draft
-
Robert Grossman, and Marco Mazzucco, DataSpace - A Web
Infrastructure for the Exploratory Analysis and Mining of Data,
IEEE Computing in Science and Engineering, July/August, 2002,
pages 44-51.
Draft
-
Robert Grossman, Mark Hornick, and Gregor Meyer, Data Mining
Standards Initiatives, Communications of the ACM, Volume 45-8, 2002,
pages 59-61.
Draft
-
R. L. Grossman and Yike Guo,
Parallel Methods for Scaling Data Mining Algorithms to Large Data
Sets, Hanndbook on Data Mining and Knowledge Discovery,
Jan M Zytkow, editor, Oxford University Press, 2002, pages 433 - 442.
Draft
-
R. L. Grossman and R. Hollebeek, The National Scalable Cluster
Project: Three Lessons about High Performance Data Mining and Data
Intensive Computing, in Handbook of Massive Data Sets, J. Abello, P. M.
Pardalos, and M. G. C. Resende, editors, Kluwer Academic
Publishers, 2002.
Draft
-
Marco Mazzucco, Asvin Ananthanarayan, Robert L. Grossman, Jorge
Levera, and Gokulnath Bhagavantha Rao, Merging Multiple Data Streams on
Common Keys over High Performance Networks,
Proceedings of the IEEE/ACM SC2002 Conference,
2002, IEEE Computer Society, page 67.
Draft
-
R. L. Grossman and R. G. Larson, An Algebraic Approach to Data
Mining: Some Examples, Proceedings of the 2002 IEEE International
Conference on Data Mining, IEEE Computer Society, Los Alamitos,
California, 2002, pages 613-616.
Draft
-
A DataSpace Infrastructure for Astronomical Data,
Robert Grossman, Emory Creel, Marco Mazzucco, Roy Williams
in R. L. Grossman, C. Kamath, W. Philip Kegelmeye,
V. Kumar, and R. Namburu, Data Mining for Scientific and
Engineering Applications, Kluwer Academic Publishers, 2001,
pages 115-123.
Draft
-
R. L. Grossman, S. Bailey, A. Ramu, B. Malhi and A. Turinsky,
The Preliminary Design of Papyrus: A System for High Performance,
Distributed Data Mining over Clusters, in Advances in Distributed and
Parallel Knowledge Discovery, H. Kargupta and P. Chan, editors, AAAI
Press/The MIT Press, Menlo Park, California, 2000, pages 259-275.
Draft
-
S. Bailey, E. Creel, R. Grossman, S. Gutti, and H. Sivakumar, A
High Performance Implementation of the Data Space Transfer Protocol
(DSTP), Large-Scale Parallel Data Mining, M. J. Zaki and C.-T. Ho,
editors, Springer-Verlag, Berlin, 2000, pages 55-64.
Draft
-
N. Sawant, C. Scharver, J. Leigh, A Johnson, G. Reinhart, E. Creel,
S. Batchu, S. Bailey, R. L. Grossman, The Tele-Immersive Data Explorer: A
Distributed Architecture for Collaborative Interactive Visualization of
Large Data-sets, 4th International Immersive Projection Technology
Workshop, Ames, Iowa, June 19-20, 2000.
Draft
-
Robert Grossman, Mark Hornick, and Gregor Meyer, Emerging
Standards and Interfaces in Data Mining, Handbook of Data Mining,
Nong Ye, editor, Lawrence Erlbaum Associates, Publishers,
Mahwah, New Jersey, 2003, pages 453-459.
Draft
-
R. L. Grossman, S. Bailey, A. Ramu and B. Malhi,
P. Hallstrom, I. Pulleyn and X. Qin, The Management and Mining
of Multiple Predictive Models Using the Predictive Model Markup
Language (PMML), Information and Software Technology,
Volume 41, 1999, pages 589-595.
Draft
-
R. L. Grossman, S. Bailey, A. Ramu, B. Malhi and H. Sivakumar, A.
Turinsky, Papyrus: A System for Data Mining over Local and Wide Area
Clusters and Super-Clusters, Proceedings of Supercomputing 1999, IEEE.
Draft
-
R. L. Grossman, The Role of QoS in Wide Area Data Mining,
Proceedings of the First Internet 2 Joint Applications Engineering QoS
Workshop: Enabling Advanced Applications Through QoS, UCAID, 1999, pages
19-21.
Draft
-
J. Leigh, A. Johnson, T. DeFanti, S. Bailey, R. L. Grossman,
A Methodology for Supporting Collaborative Exploratory Analysis of Massive
Data Sets in Tele-Immersive Environments, 8th IEEE International Symposium
on High Performance and Distributed Computing, Redundo Beach, California,
Aug 3-6, 1999.
Draft
-
J. Leigh, A. Johnson, T. DeFanti, S. Bailey, R. L. Grossman, A
Tele-Immersive Environment for Collaborative Exploratory Analysis of
Massive Data Sets, ASCI 99, pages 3-9, Heijen, the Netherlands, 1999.
Draft
-
R. L. Grossman and S. Bailey, An Overview of Dynamic
Classification: Mining Collections of Trajectories (invited paper), 1998
Proceedings of the Section on Physical and Engineering Sciences,
American Statistical Association, Alexandria, Virgina, pages 24-28.
Draft
-
Robert Grossman, Simon Kasif, Reagan Moore, David Rocke, and
Jeff Ullman, Data Mining Research: Opportunities and Challenges.
A Report of three NSF Workshops on Mining Large, Massive, and
Distributed Data, http://www.ncdm.uic.edu/m3d2.htm, 1998.
Draft
-
R. L. Grossman, Data Mining Challenges for Digital Libraries,
ACM Computing Surveys, Volume 28A (electronic), December, 1996.
Draft
-
R. L. Grossman, S. Bailey and D. Hanley, Data Mining Using Light
Weight Object Management in Clustered Computing Environments,
Proceedings of the Seventh International Workshop on Persistent Object
Stores, Morgan-Kauffmann, San Mateo, 1997, pages 237-249.
Draft
-
Haim Bodek, Robert Lee Grossman and Ivan Pulleyn,
Detecting Network Intrusions through the
Data Mining of Network Packet Data
Using the ACT Algorithm, 1997.
Draft
-
R. L. Grossman and H. V. Poor,
Optimization Driven Data Mining and Credit Scoring,
in Proceedings of the IEEE/IAFE 1996 Conference
on Computational Intelligence for Financial Engineering
(CIFEr), IEEE, Piscataway, 1996, pages 104-110.
Draft
-
R. L. Grossman, H. Bodek, D. Northcutt, and H. V. Poor, Data
Mining and Tree-based Optimization, Proceedings of the Second
International Conference on Knowledge Discovery and Data Mining, E.
Simoudis, J. Han and U. Fayyad, editors, AAAI Press, Menlo Park,
California, 1996, pp 323-326.
Draft
-
R. L. Grossman, The Terabyte Challenge: An Open, Distributed Testbed
for Managing and Mining Massive Data Sets, Proceedings of the 1996
Conference on Supercomputing, IEEE, 1996.
Draft
-
Robert L. Grossman and Dave Northcutt,
A Note on Interfacing Object Warehouses and Mass Storage Systems
for Data Mining Applications,
Proceedings of the Goddard Conference on Mass Storage Systems, 1996.
-
D. R. Quarrie, C. T. Day, S. Loken, J. F. Macfarlane,
D. Lifka, E. Lusk, D. Malon, E. May, L. E. Price, L. Cormell,
A. Gauthier, P. Liebold, J. Hilgart, D. Liu, J. Marstaller, U.
Nixdorf, T. Song, R. Grossman, X. Qin, D. Valsamis, M. Wu, W.
Xu, A. Baden, The PASS Project: A Progress Report, Proceedings
of the Conference on Computing in High Energy Physics 1994, edited
by S. C. Loken, pages 229-232, 1995.
-
D. R. Quarrie, C. T. Day, S. Loken, J. F. Macfarlane,
D. Lifka, E. Lusk, D. Malon, E. May, L. E. Price, L. Cormell,
A. Gauthier, P. Liebold, J. Hilgart, D. Liu, J. Marstaller, U.
Nixdorf, T. Song, R. Grossman, X. Qin, D. Valsamis, M. Wu, W.
Xu, A. Baden, The PASS Project Architectural Model, Proceedings
of the Conference on Computing in High Energy Physics 1994, edited
by S. C. Loken, pages 233-235, 1995.
-
E. N. May, D. Lifka, D. Malon, L. E. Price L.
Cormell, A. Gauthier, J. Marsteller, S. Mestad, U. Nixdorf R.
Grossman, X. Qin, D. Valsamis, M. Wu, W. Xu A Demonstration
of a Multi-level Object Store and its Application to the Analysis
of High Energy Physics Data, Proceedings of the Conference
on Computing in High Energy Physics 1994, edited by S. C. Loken,
pages 236-238, 1995.
-
D. Malon, D. Lifka, E. May R. Grossman, X. Qin,
W. Xu Parallel Query Processing for Event Store Data, Proceedings
of the Conference on Computing in High Energy Physics 1994, edited
by S. C. Loken, pp. 239-240, 1995.
Draft
-
R. L. Grossman, N. Araujo, X. Qin, and W. Xu,
Managing physical folios of objects between nodes,
Persistent Object Systems (Proceedings of the Sixth International
Workshop on Persistent Object Systems), M. P. Atkinson, V.
Benzaken and D. Maier, editors, Springer-Verlag and British
Computer Society, 1995, pages 217-231.
-
R. L. Grossman, A. Nerode, and W. Kohn, Nonlinear
Systems, Automata, and Agents: Managing their Symbolic Data Using
Light Weight Persistent Object Managers, International Symposium
on Fifth Generation Computer Systems, 1994: Workshop on Heterogeneous
Cooperative Knowledge-Bases, Kazumasa Yokota, editor, ICOT, pages
65-74.
-
N. Araujo, R. Grossman, D. Hanley, W. Xu,
S. Ahn, K. Denisenko, M. Fischler, M. Galli D. Malon and
E. May, Some Remarks on Parallel Data Mining Using a Persistent
Object Manager, Proceedings of the Conference on Computing
in High Energy Physics 1995.
Draft
-
S. Bailey, R. Grossman, and D. Hanley, D.
Benton and B. Hollebeek, Scalable Digital Libraries of Event
Data and the NSCP Meta-Cluster, Proceedings of the Conference
on Computing in High Energy Physics 1995.
Draft
-
R. L. Grossman, X. Qin, D. Valsamis, W. Xu, C.
T. Day, S. Loken, J. F. MacFarlane, D. Quarrie, E. May, D. Lifka,
D. Malon, L. Price, Analyzing High Energy Physics Data Using
Databases: A Case Study, Proceedings of the Seventh International
Working Conference on Scientific and Statistical Database Management,
IEEE Press, 1994, pages 283-286.
Draft
-
R. Grossman, Querying databases of trajectories
of differential equations II: index functions, Fourth NASA
Workshop on Computational Control of Flexible Aerospace Systems,
NASA Conference Proceedings, Number 10065, Part 1, L. W. Taylor,
Jr., editor, NASA Langley Research Center, 1991, pp. 35-39.
Draft
-
A. Baden and R. Grossman, Database computing
and high energy physics, Computing in High-Energy Physics 1991,
edited by Y. Watase and F. Abe, Universal Academy Press, Inc.,
Tokyo, 1991, pp. 59-66.
-
R. Grossman, Querying databases of trajectories
of differential equations I: data structures for trajectories,
Proceedings of the 23rd Hawaii International Conference on Systems
Sciences, IEEE, 1990, pages 18-23.
Draft
-
Andrew Baden and Robert L. Grossman, A Model for Computing
at the SCC, SSC Technical Report, June 6, 1990.
Draft
This is from www.rgrossman.com