IDEA GROUP PUBLISHING
Acquisition Editor: Mehdi Khosrow-Pour
Published in the United States of America by
and in the United Kingdom by
Copyright 2003 Idea Group Inc.
All rights reserved. No part of this book may be reproduced in any form or by any means, electronic or mechanical, including photocopying, without written permission from the publisher.
Library of Congress Cataloging-in-Publication Data
Wang, John, 1955-
1. Data mining. I. Title.
British Cataloguing in Publication Data
About the Authors
John Wang is a professor in the Department of Information and Decision Sciences at Montclair State University (MSU), USA. Having received a scholarship award, he came to the USA and completed his Ph.D. in Operations Research from Temple University in 1990. He worked as an assistant professor at Beijing University of Sciences & Technology, China, for two years. In the fall of 1992, he transferred to MSU. Dr. Wang received his tenure in 1997 and was promoted to full professor in 2000 for his outstanding and extraordinary contributions. Dr. Wang has published 72 papers in refereed journals and conference proceedings, as well as two research books. He has been an active member of five renowned professional organizations. He has served as session chairman and track chair seventeen times on the most prestigious international and national conferences. His research activities and articles have been well received, enabling him to build a reputation with other significant professionals in his field. He was further invited to serve as a referee for Operations Research (a flagship journal) and IEEE Transactions on Control Systems Technology (a very prestigious journal). He has also developed several computer software programs based on his research findings. Dr. Wang is a highly accomplished and well-established operations research scholar. His current research interests include optimization, nonlinear programming, and manufacturing systems engineering. A long-term goal is to study the synergy of operations research and cybernetics.
Stefan Arnborg received his M.Sc.E. and Dr.Tech. in Engineering Physics from the Swedish Royal Institute of Technology in 1968 and 1972, respectively. In 1971, he joined the Swedish National Defense Research Institute and in 1979, Philips Elektronikindustrier AB. Since 1982, he is a professor in Computer Science at the Royal Institute of Technology. His interests are in education, operations research and defense systems, programming systems, algorithms, biomedical informatics, verification, inference, and uncertainty management. He is chairman of the Computer Engineering program at the Royal Institute of Technology and scientific advisor to the Swedish Institute of Computer Science.
Barbro Back is professor in Accounting Information Systems at bo Akademi University in Turku, Finland. Her research interests include accounting information systems, intelligent systems in business, neural networks, and data mining. She has presented her research in the Journal of Management Information Systems, Accounting Management and Information Technology, European Journal of Operations Research, International Journal of Intelligent Systems in Accounting, Management and Tax, Advances in Accounting, and other journals. She currently serves on the editorial boards of The New Review of Applied Expert Systems and Emerging Technologies, and The International Journal of Accounting.
Christian B hm was born in Argentina in 1980. He is pursuing his Information Systems Engineering degree at Universidad Tecnol gica Nacional Facultad Regional Santa Fe, Argentina. He holds a fellowship of UTN-FRSF and has been working for GIDSATD (Group of Research and Development in Decision Support Systems) since 2001. His research interest is related to data mining.
Marvin L. Brown is an assistant professor in the School of Business Administration at Hawaii Pacific University, USA. He holds a B.S. from Shepherd College, an M.B.A. from Morehead State University, and is A.B.D. in Computer Information Systems from Cleveland State University. He has several years of industrial experience in database management and information technology as a consultant to NASA, the U.S. Federal Energy Regulatory Commission, and the U.S. Federal Department of Transportation. He is also a former partner in the consulting firm, Mardale Consulting LLC.
Robin Burke is an associate professor in the Department of Information Systems and Decision Sciences at California State University, Fullerton. He received his Ph.D. in 1993 from the Institute for the Learning Sciences at Northwestern University. In 1996, he was co-founder of the Intelligent Information Laboratory at the University of Chicago. His research is on the application of artificial intelligence to electronic commerce and digital libraries, particularly recommender systems, integrating a variety of natural language, information retrieval, machine learning, and artificial intelligence techniques, with an emphasis on case-based reasoning.
Yao Chen is an assistant professor of Manufacturing and Information Systems in the College of Management at the University of Massachusetts, Lowell, USA. Her current research interests include efficiency and productivity issues of information systems, information technology's impact on operations performance, and methodology development of Data Envelopment Analysis. Her work has appeared in such journals as European Journal of Operational Research, International Journal of Production Economics, Information Technology & Management Journal and others. She is a member of the Institute for Operations Research and Management Sciences (INFORMS).
Omar Chiotti was born in Argentina in 1959. He received his degree in Chemical Engineering in 1984 from Universidad Tecnol gica Nacional and his Ph.D. in Engineering from Universidad Nacional del Litoral, Argentina, in 1989. Since 1984, he has been working for CONICET (Consejo Nacional de Investigaciones Cient ficas y T cnicas), currently as a researcher. He is a professor of Information Systems Engineering at Universidad Tecnol gica Nacional Facultad Regional Santa Fe, Argentina, since 1986. Currently, he teaches Management Systems. He is the director of the GIDSATD (Group of Research and Development in Decision Support Systems) since 1994. His current research interests are focused on decision-support systems engineering.
Jack S. Cook is an associate professor of Information Systems at the Rochester Institute of Technology (RIT), USA. His specialties are Electronic Commerce, Information Systems and Production/Operations Management. Dr. Cook has extensive experience in teaching and training experience in these fields that spans over two decades and includes over 50 conference presentations and numerous journal articles. Dr. Cook is a Certified Fellow in Production and Inventory Management (CFPIM). His education includes a Ph.D. in Business Administration, an M.S. in Computer Science, an M.B.A., an M.A. in Mathematics, and a B.S. in Computer Science.
Laura L. Cook works for the Computing & Information Technology Department at the State University of New York at Geneseo, USA. She is currently a graduate student in Information Technology at the Rochester Institute of Technology. Laura has four journal publications and has given numerous presentations. She also volunteers as the Publicity Director for the Rochester, NY, Chapter of APICS. Laura has taught Electronic Commerce for the Jones School of Business and Computers in Education for the School of Education at SUNY-Geneseo. She has also taught many technology workshops on various topics.
Massimo Coppola (1969) is a Ph.D. candidate in Computer Science at the Department of Computer Science of the University of Pisa, Italy. He received his master's in C.S. in 1997, and works with the Parallel Architecture Research Group. He is a member of the ACM SIG on Knowledge Discovery and Data Mining. His research interests also include parallel architectures and programming languages for parallel computation. The research topic of his Ph.D. studies is the application of structured parallel programming to high-performance data mining, out-of-core and data-intensive algorithms.
Sorin Draghici has obtained his B.Sc. and M.Sc. degrees from Politehnica University in Bucharest, Romania, followed by a Ph.D. degree from the University of St. Andrews (third oldest university in UK after Oxford and Cambridge). He has published over 50 peer-reviewed journal and conference publications, as well as three book chapters. He is co-inventor on four patent applications and a frequent journal reviewer and NSF panelist. Currently, Dr. Draghici is the head of the Intelligent Systems and Bioinformatics in the Department of Computer Science at Wayne State University, USA (http://vortex.cs.wayne.edu/).
Tomas Eklund received an M.Sc. (Econ.) in Information Systems from bo Akademi University, in Turku, Finland, in 2001. He is currently working as a researcher and is pursuing a Ph.D. at the Department of Information Systems at bo Akademi University. His primary research interests include data mining, knowledge discovery, and neural networks.
Farshad Fotouhi received his Ph.D. in Computer Science from Michigan State University in 1988. He joined the faculty of Computer Science at Wayne State University in August 1988, where he is currently an associate professor and associate chair of the department. Dr. Fotouhi's major area of research is databases, including relational, object-oriented, multimedia/hypermedia systems, and data warehousing. He has published over 80 papers in refereed journals and conference proceedings, served as a program committee member of various database-related conferences, and he is currently a member of the Editorial Advisory Board of the Journal of Database Management.
Mar a Rosa Galli was born in Argentina in 1958. She received her degree in Chemical Engineering in 1983 from Universidad Nacional de Mar del Plata, Argentina and a Ph.D. in Chemical Engineering from Universidad Nacional del Litoral, also in Argentina. Since 1984, she has been working for CONICET (Consejo Nacional de Investigaciones Cient ficas y T cnicas), currently as a researcher. She is a professor of Operation Research of Information Systems Engineering at Universidad Tecnol gica Nacional Facultad Regional Santa Fe, Argentina, since 1989 and co-director of GIDSATD (Group of Research and Development in Decision Support Systems) since 1995. Her current research interests are focused on decision-support systems engineering and multiagent systems.
Donald W. Gribbin is an associate professor of Accountancy at Southern Illinois University at Carbondale, USA. He received a Ph.D. in Business Administration from Oklahoma State University. He has published papers in Decision Sciences, Journal of Management Accounting Research, Journal of Business Finance & Accounting, and The British Accounting Review. His research interests include the distributional properties of financial ratios and the modeling of various types of cost data.
Jerzy W. Grzymala-Busse (M.S. in Electrical Engineering, Technical University of Poznan, Poland, 1964; M.S. in Mathematics, University of Wroclaw, Poland, 1967; Ph.D. in Engineering, Technical University of Poznan, Poland, 1969) is a professor of Electrical Engineering and Computer Science at the University of Kansas, USA. His research interests include data mining, knowledge discovery from databases, machine learning, expert systems, reasoning under uncertainty, and rough set theory. He has published three books and over 150 articles in the above areas, mostly in data mining. He has presented numerous invited presentations on international scientific conferences and has served as a session chair and in steering committees, advisory committees, and program committees of various international and national scientific and technical conferences.
Jeffrey Hsu is an assistant professor of Information Systems at the Silberman College of Business Administration, Fairleigh Dickinson University, USA. His research interests include human-computer interaction, e-commerce, groupware, distance learning, and data mining. He is the author of six books and numerous papers and articles, and has professional experience in the IT industry. Hsu holds a Ph.D. in Management Information Systems from Rutgers University, three master's degrees, and several professional certifications. Dr. Hsu is always interested in discussing opportunities for research and other scholarly activities, and can be reached via e-mail at email@example.com.
William H. Hsu received B.S. degrees in Computer Science and Mathematical Sciences and a Master of Science in Engineering in Computer Science from Johns Hopkins University in May 1993. He received his Ph.D. degree from the University of Illinois at Urbana-Champaign in 1998. He is currently an assistant professor of Computing and Information Sciences at Kansas State University, USA, and a visiting research scientist at the National Center for Supercomputing Applications. His research interests include machine learning and data mining, probabilistic reasoning, decision support and sensor fusion in automation, learning spatial and temporal models from data, and soft computing.
Michael Johnson obtained his B.S. in Computer Engineering from the University of California, San Diego, in 1987. After working in industry at GTE Communications and AT&T Bell Labs for four years, he returned to school and obtained his M.S. in Computer Science at Michigan State University in 1993. For the past eight years he has been an assistant professor and head of the Computer Science Department at Madonna University in Livonia, Michigan, USA. Mr. Johnson is currently completing his Ph.D. studies at Wayne State University.
YongSeog Kim received an M.S. degree in 2000 in Computer Science, and a Ph.D. in Business Administration in 2001 from the University of Iowa. He is currently a postdoctoral student in the Management Sciences Department at the University of Iowa, USA. His research interests are in machine learning and data mining including feature selection in supervised and unsupervised learning, evolutionary algorithms, neural networks, and ensemble methods. In particular, he is interested in applying data-mining algorithms to real-world business applications. He is a member of INFORMS, IEEE, ACM, and AAAI.
John F. Kros is an assistant professor in the Decision Sciences Department, School of Business Administration at East Carolina University, USA. He holds a B.B.A. from the University of Texas, Austin, an M.B.A. from Santa Clara University, and a Ph.D. in Systems Engineering from the University of Virginia. He has several years of industrial experience in electronics manufacturing, specifically in operations management, and is a member of INFORMS, DSI, and ASQ.
Vladimir A. Kulyukin is an assistant professor of Computer Science at the School of Computer Science, Telecommunications and Information Systems of Utah State University, USA. He received his Ph.D. in Computer Science from the University of Chicago in 1998. His research interests are information retrieval and robotics.
Eitel J.M. Laur a has worked in the IT arena for over 15 years and has consulted with many multinational corporations across a wide range of industries, advising on such topics as Decision- Support Systems, Business Intelligence, Client/Server Technology and Web-Based Applications. He holds an M.S. in Electrical Engineering from the University of Buenos Aires, Argentina, and an M.B.A. from Universidad del Salvador (USAL), Argentina. A former faculty member at USAL, he is currently a lecturer at the School of Business, University at Albany, while completing his Ph.D. in Information Science. His teaching and research interests cover the fields of Information Technology, Information Decision Systems and Statistical Data Mining.
Chung-Hong Lee received an M.Sc. in Information Technology for Manufacture from the University of Warwick in 1994. He has been awarded a Ph.D. in Computer Science from the University of Manchester in 1997. He currently is an assistant professor at the Chang Jung University, Taiwan. He has worked at Chang Jung University since 1998, prior to which he was a postdoctoral fellow at Institute of Information Science, Academia Sinica, Taiwan. Current research interests include multilingual text mining, automatic ontology acquisition, computational linguistics, and information retrieval. He is both members of ACM and IEEE.
Lori K. Long is a doctoral student in the Department of Management & Information Systems, the College of Business, Kent State University in Kent, Ohio, USA. Her concentration is Human Resources Management, and her current research interests include the role of technology in effective human resources management, e-learning, and computer-based employee assessment.
Filippo Menczer is an assistant professor in the Department of Management Sciences at the University of Iowa, USA. He received a Laurea in Physics from the University of Rome in 1991, and a dual Ph.D. in Computer Science and Cognitive Science from the University of California at San Diego in 1998. Dr. Menczer has been a Fulbright, Rotary Foundation, NATO, and Santa Fe Institute fellow. He pursues interdisciplinary research spanning from ecological theory to distributed information systems; these contribute to artificial life, agent-based computational economics, evolutionary computation, neural networks, machine learning, and adaptive intelligent agents for Web, text, and data mining.
Alan Oppenheim obtained his B.S. in Mechanical Engineering and his M.S. in Industrial Management from the Polytechnic Institute of Brooklyn (now the Polytechnic University), and his Ph.D. in Business Administration from New York University. He is dean of the School of Business and professor of Information and Decision Sciences at Montclair State University, USA. He is an active member of several professional organizations, has consulted on quality and productivity with various firms, and has presented seminars nationwide. He has published many articles, contributed to several texts, and is the co-author of the book Quality Management: Tools and Methods for Improvement published by Richard D. Irwin, which is about to enter its third edition and which has received widespread industrial and academic acceptance.
Eric Paquet has a Ph.D. in Computer Vision from Laval University, and he is currently working with the Visual Information Technology Group at the National Research Council of Canada. He is working in collaboration with various enterprises, research institutes, and universities. He is representative at the Web3D Consortium and MPEG. He is the author of numerous publications and has won many international awards. His research activities include content-based management of multimedia information, virtual environments, collaborative virtual environments, and anthropometric databases.
Fay Cobb Payton is an assistant professor of Information Technology at North Carolina State University, USA. She earned a Ph.D. in Information Systems from Case Western Reserve University. She holds a B.S. in Industrial and Systems Engineering from Georgia Institute of Technology and an M.B.A. in Decision Sciences from Clark Atlanta University. Her research interests and projects include healthcare, information privacy, supply chain management, data warehousing/mining, diversity, and systems implementation. She has published in Communications of the ACM, Health Care Management Review, Information and Management, and International Journal of Technology Management. Dr. Payton is also on the editorial board of IT Professional - an IEEE publication.
Gys le Roux is a lecturer at the Department of Informatics, University of Pretoria, South Africa. He holds an M.Com. in Informatics and a B.Sc. (Hon.) in Computer Science. His research interests include data mining, data warehousing, and data visualization.
Murali S. Shanker is an associate professor in the Department of Management and Information Systems at Kent State University, USA. He received a Ph.D. from the Department of Operations and Management Science, University of Minnesota. His research interests are knowledge management and data mining, simulation, and distributed processing. His papers have appeared in journals such as INFORMS Journal on Computing, IIE Transactions, Omega, and Annals of Operations Research. His current research includes developing tools to optimize web-server performance, and in conducting a statistical analysis of the behavior of neural networks as applied to classification and prediction problems.
W. Nick Street is an assistant professor in the Management Sciences Department at the University of Iowa, USA. He received a Ph.D. in 1994 in Computer Sciences from the University of Wisconsin. His research interests are machine learning and data mining, particularly the use of mathematical optimization in inductive-learning techniques. His recent work has focused on dimensionality reduction (feature selection) in high-dimensional data for both classification and clustering, ensemble prediction methods for massive and streaming data sets, and learning shapes for image segmentation, classification, and retrieval. He has received an NSF CAREER award and an NIH INRSA postdoctoral fellowship.
Giri Kumar Tayi is a professor in the Department of Management Science and Information Systems in the School of Business at the State University of New York at Albany, USA. He obtained his Ph.D. from Carnegie-Mellon University, Pittsburgh, USA. His teaching and research interests are interdisciplinary and cover the fields of Information Systems and Technology, Service and Operations Management, and Operations Research. He has published over 30 scholarly papers in academic and refereed journals and has been the guest editor of several journal special issues on topics such as Data Quality, Operations Management, Communication Networks: Design and Management, and Mobile Computing.
Marvin D. Troutt is a professor in the Department of Management & Information Systems and in the Graduate School of Management at Kent State University, USA. He received a Ph.D. in Mathematical Statistics from the University of Illinois at Chicago. He is an associate editor of Decision Sciences Journal and a Fellow of the Decision Sciences Institute. His publications have appeared in Decision Sciences, Management Science, Journal of the Operational Research Society, European Journal of Operational Research, Operations Research, Decision Support Systems, Naval Research Logistics, Statistics, and others. His current research interests include the statistics of performance data with application to data mining and decision support.
Hannu Vanharanta began his professional career in 1973 as technical assistant at the Turku office of the Finnish Ministry of Trade and Industry. Between 1975 and 1992, he worked for Finnish international engineering companies, i.e., Jaakko P yry, Rintekno, and Ekono as process engineer, section manager, and leading consultant. His doctoral thesis was approved in 1995. In 1995-1996, he was professor in Business Economics in the University of Joensuu. In 1996-1998, he served as purchasing and supply management professor in the Lappeenranta University of Technology. Since 1998, he has been Professor of Industrial Management and Engineering at Pori School of Technology and Economics, Tampere University of Technology, Finland. He is a member of IPSERA and a Research Fellow at the Institute for Advanced Management Systems Research, bo Akademi.
Marco Vanneschi is a full professor in Computer Science at the Department of Computer Science of the University of Pisa, Italy. His research and teaching activity has been developed in the area of high performance computing, systems architecture, and programming models. He has coordinated national and international projects. He has been member of the official Working Group of MURST on High Performance Computing, and of several international committees (IFIP, IEEE, EuroPar, ParCo, international journals and conferences). He is author of more than 150 scientific papers, three books on computer architecture and parallel programming, and he is scientific editor of six international books.
Herna Viktor is an assistant professor at the School of Information Technology and Electrical Engineering (SITE) of the University of Ottawa, Ottawa, Ontario, Canada. She holds a Ph.D. in Computer Science. Her research interests include data mining, multiagent learning systems, and data warehousing. Professor Viktor is also a visiting professor at the Department of Informatics, University of Pretoria, Pretoria, South Africa.
Ari Visa received his M.S. in Computer Science and Technology from Link ping University of Technology, Sweden, in 1981. The Licentiate and the Doctor of Technology degrees in Information Science he received from the Helsinki University of Technology, Finland, in 1988 and 1990, respectively. Since 1996, he has been a professor, first at Lappeenranta University of Technology and from the beginning of 2000 at Tampere University of Technology in Finland. He is currently a professor in Digital Signal Processing at Tampere University of Technology, and a docent in Image Analysis at HUT. His current research interests are multimedia and multimedia systems, adaptive systems, wireless communications, distributed computing, soft computing, computer vision, knowledge mining, and knowledge retrieval. Professor Visa is the former president and vice president of the Pattern Recognition Society of Finland.
Hsin-Chang Yang received his B.S. degree in Computer Engineering in 1988 from the National Chiao-Tung University, Hsin-Chu, Taiwan, and M.S. and Ph.D. degrees in Information Science and Computer Engineering in 1990 and 1996, respectively, from the National Taiwan University, Taipei, Taiwan. Since 1998, Dr. Yang has been an assistant professor of Information Management at Chang Jung University, Tainan, Taiwan. Dr. Yang's research interests include pattern recognition, neural networks, knowledge discovery, and information retrieval. He is a member of both ACM and IEEE.
Aimao Zhang received her Bachelor of Science from Indiana University of Pennsylvania in 1990. She earned her master's of Business Administration in 1991 from Indiana University of Pennsylvania. She was awarded a Doctor of Philosophy in 2001 from Southern Illinois University at Carbondale, with a major in Management Information Systems and a minor in Production/Operations Management. Zhang has broad interests in teaching and research. She has taught and is teaching courses in Information Technology. Zhang's publications include book chapters, papers in refereed journals, and national conferences.
Joe Zhu is an assistant professor of Operations, Department of Management at Worcester Polytechnic Institute, Worcester, MA. Professor Zhu is an expert in methods of performance measurement and his research interests are in the areas of information technology and productivity, and performance evaluation and benchmarking. He has published a book focusing on performance evaluation and benchmarking using Data Envelopment Analysis (DEA) and developed the DEAFrontier software. With more than 150 DEA models, this software can assist decision-makers in benchmarking and analyzing complex operational efficiency issues in manufacturing organizations as well as evaluating processes in banking, retail, franchising, healthcare, e-business, public services and many other industries. His research has appeared in such journals as Management Science, Operations Research, Annals of Operations Research, Journal of Operational Research Society, European Journal of Operational Research, Computer and Operations Research, OMEGA, Socio-Economic Planning Sciences, Journal of Productivity Analysis, INFOR, and Journal of Alternative Investment.
Wojciech Ziarko received an M.Sc. degree in Applied Mathematics from the Faculty of Fundamental Problems of Technology of the Warsaw University of Technology in 1975. In 1980, he received a Ph.D. degree in Computer Science at the Institute of Computer Science of Polish Academy of Sciences. In 1982, he joined the University of Regina, Canada, where he is now a professor in the Computer Science Department. His research interests include knowledge discovery, machine learning, pattern classification, and control algorithm acquisition. These research interests are largely motivated by the theory of rough sets. He is one of the pioneers of this research area.