Maurizio Rafanelli, Istituto di Analisi dei Sistemi ed Informatica – C. N. R., Italy
IDEA GROUP PUBLISHING
Acquisition Editor: Mehdi Khosrow-Pour
Senior Managing Editor: Jan Travers
Managing Editor Amanda Appicello
Development Editor: Michele Rossi
Copy Editor: Maria Boyer
Typesetter: Amanda Lutz
Cover Design: Weston Pritts
Printed at: Integrated Book Technology
Published in the United States of America by
Idea Group Publishing (an imprint of Idea Group Inc.)
701 E. Chocolate Avenue, Suite 200
Hershey PA 17033
Web site: http://www.idea-group.com
and in the United Kingdom by
Idea Group Publishing (an imprint of Idea Group Inc.)
3 Henrietta Street
London WC2E 8LU
Tel: 44 20 7240 0856
Fax: 44 20 7379 3313
Web site: http://www.eurospan.co.uk
Copyright © 2003 Idea Group Inc.
All rights reserved. No part of this book may be reproduced in any form or by any means, electronic or mechanical, including photocopying, without written permission from the publisher.
Library of Congress Cataloging-in-Publication Data
Rafanelli, M. (Maurizio)
Multidimensional databases : problems and solutions / Maurizio Rafanelli.
(hard cover) -- ISBN 1-59140-086-4 (ebook)
1. Multidimensional databases. I. Title.
QA76.9.D3 R219 2003
British Cataloguing in Publication Data
A Cataloguing in Publication record for this book is available from the British Library.
About the Authors
Maurizio Rafanelli is senior scientist at the Istituto di Analisi dei Sistemi ed Informatica "A. Ruberti" and in charge of the research area, "Information Systems and Knowledge Bases for Complex Information Structures." He is author and co-author of many international publications (journals, conferences, books, etc.). He was a part-time professor at the University of Roma-La Sapienza (Roma-1) and at the University of Roma-Tor Vergata (Roma-2) from 1981 to 1997. He was general chairman at the 4th International Conference on Statistical and Scientific Database Management (Proceedings, Lecture Notes in Computer Science, No. 339, Springer-Verlag) in 1988 and general chairman at the 10th International Conference on Scientific and Statistical Database Management (Proceedings, IEEE Publications) in 1998. He has been a program committee member of various international conferences and cooperates with many journals as a reviewer. He is author of the entry, "Data Models in Statistical and Scientific Databases," in the Encyclopedia of Computer Science and Technology (Kent & Williams, Exec. Eds.).
Amr El Abbadi received his PhD in Computer Science from Cornell University. Since 1987 he is a professor in the Department of Computer Science at the University of California, Santa Barbara. Between 1990 and 1999 he held visiting positions at various institutions, including the University of Campinas in Brazil, IBM Almaden Research Center, the Swedish Institute of Computer Science in Stockholm, and at IRISA at the University of Rennes in France. He is currently the area editor for Information Systems: An International Journal, an editor of Information Processing Letters (IPL), and associate editor of the Bulletin of the Technical Committee on Data Engineering. He was vice chair of the 1999 International Conference on Distributed Computing Systems, vice chair for the International Conference on Data Engineering 2002, and the Americas program chair for the 2000 International Conference on Very Large Data Bases. Dr. El Abbadi's main research interests are in the area of distributed information management systems, including databases, digital libraries, and data management of moving objects.
Divyakant Agrawal is a professor of Computer Science at the University of California, Santa Barbara. His research interests include distributed systems, computer networks, databases, large-scale information systems and digital libraries, and data management of moving objects. He received his BE from Birla Institute of Technology and Science, Pilani, India, and his MS and PhD in Computer Science from the State University of New York at Stony Brook. Professor Agrawal is the editor for Distributed and Parallel Databases, An International Journal and was a member of the program committee for various international conferences, including ACM SIGMOD International Conference on Management of Data, ACM Symposium on Principles of Database Systems (PODS), and International Conference on Very Large Data Bases (VLDB). He is also a member of the ACM and the IEEE Computer Society.
Elena Baralis is an associate professor at Politecnico di Torino. She obtained a PhD in Information and Systems Engineering from Politecnico di Torino in 1994. Her research activity is focused on active database systems, data warehousing, and data mining. She teaches courses on database design and technology.
Andrea Calí received a Laurea (MS) in Electrical Engineering from the University of Roma-La Sapienza, and now he is a third-year PhD student in Computer Engineering at the Department of Computer and System Sciences of the University of Roma-La Sapienza. His research interests include information integration, data warehousing, database models, knowledge representation. He is involved in several national and international research projects. His latest publications deal with investigating the expressiveness of data integration systems and managing integrity constraints in information integration.
Curtis E. Dyreson is an Assistant Professor of Computer Science at Washington State University. Prior to working in academia, he consulted in industry for Citibank and Chemical Bank, UK. He has held faculty positions in Australia at James Cook University and Bond University, and was a visiting associate professor at Aalborg University, Denmark. He is an author or coauthor of numerous papers on temporal and multidimensional databases. Currently, he is the ACM TODS information director.
Christian S. Jensen is a professor of Computer Science at Aalborg University, Denmark, and an honorary professor at Cardiff University. He has authored or coauthored numerous scientific papers on database semantics, modeling, and performance. He is on the editorial boards of ACM TODS and IEEE Data Engineering Bulletin. He was recently co-program committee chair for the Workshop on Spatio-Temporal Database Management, held with VLDB'99, and for the 8th International Symposium on Spatial and Temporal Databases; also he was program committee chair for the 2002 EDBT Conference. He serves on the boards of directors for a small number of companies.
Domenico Lembo is a second-year PhD student at the Department of Computer and System Sciences of University of Rome-La Sapienza, Italy. His research concerns database models, information integration and data warehousing, logic programming, and knowledge representation. He is currently involved in various Italian and European research projects that aim at providing innovative methodologies and advanced reasoning capabilities for information integration and data warehousing. Recently, he co-authored several papers on modeling and semantics of data integration systems.
Maurizio Lenzerini, born in Pavia, has serves as full professor in Computer Science and Engineering since 1990. He is the author of several academic books on fundamentals of computer science, software engineering, and database design. Since 1983, he has been carrying out his research activity at the University of Rome-La Sapienza, where he is leading a research group on databases and artificial intelligence. His main research interests are oriented towards conceptual and semantic data modeling, data integration, data warehousing, semistructured data management, knowledge representation and reasoning, and object-oriented methodologies. He is currently involved in national and international research projects on data integration, data warehousing, and semi-structured data. He is the author of more than 200 publications in international conferences and journals, including the most prestigious ones in the above-mentioned areas, such as Journal of Computer and System Science, Information and Computation, Artificial Intelligence, Information Systems, IEEE Data and Knowledge Engineering, ACM-PODS, ACM-SIGMOD, IEEE-ICDE, VLDB, ICDT, IJCAI, AAAI, KR, and CoopIS. He is the editor of several international books, including a recent one on Data Warehouse Quality. He is regularly a member of the program committee of the most important international conferences in the above areas, including IJCAI, AAAI, EDBT, PODS, KR, CoopIS, ER, and ICDT. He organized several international conferences and workshops. He is a member of the editorial board of various international journals, and is the editor of Information Systems: An International Journal, for the area of data modeling, knowledge representation, and reasoning. He was program co-chair of the 4th International Conference on Cooperative Information Systems, held in Edinburgh in 1999. He was the conference chair of the International Conference on Conceptual Modeling, held in Paris in 1999. He will serve as the program chair of ICDT 2003.
Francesco M. Malvestuto was born in Sulmona, Italy. He earned his degrees from the University of Rome-La Sapienza, including one in Physics in 1974, a master's in Informatics in 1978, and a master's in System Engineering in 1980. His research interests evolved from statistical databases to uncertainty management in artificial intelligence, and axiomatizion of the probability-theoretic notions of independence and irrelevance. His current research fields include uncertainty management, database theory, computational statistics, and graph theory. He is a professor of Computer Science at the University of Rome-La Sapienza, and is a member of the IASC and SIS.
Alberto Mendelzon was born in Buenos Aires, and received his MA, MSE, and PhD degrees from Princeton University. He spent a post-doctoral year at the IBM T.J. Watson Research Center in 1979-1980 and has been with the University of Toronto since 1980. He has been a visiting scientist at the IBM Centre for Advanced Studies, AT&T Bell Laboratories, NTT Basic Research Labs in Musashino, Japan, and the IASI in Rome. His research interests are in databases and knowledge bases, including database design theory, query languages, database visualization, query processing, Web-based information systems, and data warehousing. He has been associate and acting chair of the Computer Systems Research Institute and chaired or co-chaired the program committees of the major database conferences. He was a guest editor for the Journal of Computer and System Sciences and the VLDB Journal, and is on the Editorial Board of the Journal of Digital Libraries, World Wide Web Journal, ACM Digital Reviews, and Theory and Practice of Object Systems. He is the current information director for the ACM Special Interest Group on Management of Data (SIGMOD).
Marina Moscarini was born in Roma. She earned her degree in Mathematics from the University of Rome-La Sapienza in 1973. Since 1993 she is a professor of Computer Science at the University of Rome-La Sapienza, where she currently heads the Computer Science Department. Prior to joining the University of Rome-La Sapienza, she was a full-time researcher at IASI-CNR at Rome until 1989, and from 1989 to 1993 she was professor at the University of Rome-Tor Vergata. Her research interests evolved from the study of classical difficult (NP-complete) graph theoretic problems on particular classes of graphs, to the study of problems related to database design, query optimization, and statistical database security.
Stefano Paraboschi is an associate professor at the Dipartimento di Elettronica e Informazione of Politecnico di Milano. He received the Laurea Degree in Ingegneria Elettronica in 1990, and a PhD in Ingegneria Informatica in 1994, both from Politecnico di Milano. His main research interests are in the area of databases, with a focus on active databases, data warehouses, and the construction of data-intensive websites. He is the author, together with Paolo Atzeni, Stefano Ceri, and Riccardo Torlone, of the book, Database Systems: Concepts, Languages and Architectures (McGraw-Hill, 1999).
Torben B. Pedersen is an associate professor of Computer Science at Aalborg University, Denmark. Previously, he has worked as a visiting scientist at Lawrence Berkeley National Laboratory and as a database specialist at the largest Danish software company, Kommunedata. He has authored or co-authored papers on multidimensional data modeling and semantics, OLAP performance issues, OLAP data integration, and clickstream analysis. He is on the program committees for the EDBT 2002 conference and the DMDW 2002 workshop, and has reviewed papers for SIGMOD, VLDB, ICDE, CAiSE, SSDBM, and KAIS. He has extensive industry collaboration within the business intelligence industry.
Elaheh Pourabbas received her master's degree in Engineering from the University of Roma-La Sapienza in 1992, and her PhD in Bioengineering from the University of Bologna in 1997. She is currently a researcher at the Istituto di Analisi dei Sistemi ed Informatica "Antonio Ruberti" of the Italian National Research Council. Dr. Pourabbas' interests include database theory, data models, data warehousing and OLAP, spatio-temporal databases, and database systems.
Mirek Riedewald is a PhD candidate in the Computer Science Department at the University of California, Santa Barbara. His dissertation research focuses on supporting aggregation and summarization in data warehouses and digital libraries. He was due to graduate in the summer of 2002 and join Cornell University as a research associate. His research interests include database and information systems, especially On-Line Analytical Processing (OLAP), digital libraries, data streams, and distributed systems. His work has been published in the proceedings of prestigious scientific conferences such as the International Conference on Very Large Data Bases (VLDB) and the ACM SIGMOD International Conference on Management of Data.
Riccardo Rosati is assistant professor in Computer Science and Engineering at Università di Roma-La Sapienza, Italy. His main research interests are oriented towards artificial intelligence and databases, in particular knowledge representation and reasoning, planning and cognitive robotics, information integration, and data warehousing. He is currently involved in national and international research projects on information integration. He is the author of more than 50 publications in international conferences and journals, such as Artificial Intelligence, ACM Transactions on Computational Logic, JAIR, IJCAI, AAAI, KR, and CoopIS.
Giuseppe Sindoni has been an applicative researcher at ISTAT for four years. He holds a PhD in Information Systems Engineering and an Honours Degree in Electronic Engineering. He also served as associate professor of Informatics at the Faculty of Economics, Roma Tre University for the 2001-2002 academic year. Before joining ISTAT, he was a scientific collaborator and assistant professor at the Department of Computer Science and Automation, Roma Tre University, and scientific collaborator of the Istituto per l'Analisi dei Sistemi e l'Informatica (IASI) of the Italian National Research Council (CNR). His expertise covers the following areas: integration of (statistical) information systems; Web data quality; databases; (statistical) databases and the Web, and geographical information systems. He has published more than 20 papers in these fields, mainly in refereed journals and international conference proceedings. Additionally, he has 15 years' experience as a freelance consultant to several national and international organizations.
Arie Shoshani is a senior staff scientist at Lawrence Berkeley National Laboratory; he joined LBNL in 1976. He heads the Scientific Data Management Group. He received his PhD from Princeton University in 1969, and from 1969 to 1976, he was a researcher at System Development Corporation, where he worked on the Network Control Program for the ARPAnet, distributed databases, database conversion, and natural language interfaces for data management. His current areas of work include data models, query languages, temporal data, statistical and scientific database management, storage management on tertiary storage, and grid storage middleware. Dr. Shoshani is also the director of a Scientific Data Management (SDM) Integrated Software Infrastructure Center (ISIC), one of seven centers (budget, $3 million) selected by the SciDAC program at DOE in 2001. In this capacity, he is coordinating the work of collaborators from four DOE laboratories and four universities (see http://sdmcenter.lbl.gov). Dr. Shoshani has published more than 60 technical papers in refereed journals and conferences; chaired several workshops, conferences, and panels in database management; and served on numerous program committees for various database conferences. He also served as an associate editor for the ACM Transactions on Database Systems. He was elected a member of the VLDB Endowment Board, served as the publication board chairperson for the VLDB Journal, and as the vice-president of the VLDB Endowment. His homepage is http://www.lbl.gov/~arie.
Ernest Teniente is an associate professor in the Software Department at the Universitat Politècnica de Catalunya in Barcelona. He teaches courses on software engineering and database technology. His research interests include data warehousing, database updating and conceptual modeling of information systems. His e-mail address is email@example.com.
Leonardo Tininini is a researcher at the Italian National Institute of Statistics (ISTAT). He has written several scientific papers on statistical databases, aggregate data, query languages, spatio-temporal databases, and been referee for important international conferences and journals. He is lecturer at the University of Rome-Campus Bio-Medico. He also collaborates with the Institute of Analysis on Systems and Computer Science of the Italian Research National Council, and with the French research institute INRIA. He is leading a project for the dissemination of the Italian Census 2001 data through a Web-based data warehouse.
Riccardo Torlone is a professor of Computer Science at the Università Roma Tre, Italy. In the past, he had teaching and research appointments at IASI-CNR and at University of Roma-La Sapienza. He has authored or co-authored numerous scientific papers on database theory, database design, active databases, object-oriented query languages, and data warehousing. He has also co-authored the book, Database Systems: Concepts, Languages and Architectures (McGraw Hill). He served on the program committee of several international conferences on database technology and information systems.
Alejandro Vaisman was born in Buenos Aires. He is a civil engineer and computer scientist, and holds a PhD in Computer Science from the University of Buenos Aires. He was a visiting researcher at the University of Toronto, Canada, and invited lecturer at the Polytechnic University of Madrid. He has authored and co-authored several scientific papers presented in major database conferences like ICDE and VLDB. His research interests are in relational and deductive databases, OLAP and data warehousing, temporal databases, data mining, and Web-based information systems. He has worked in design and operation of database systems, and he is currently vice-dean at the University of Belgrano, Argentina.