Anda di halaman 1dari 8

• Example: student data in a university

– student address may be needed for registering, library management, financial


office, grade reporting, ...
– each application separately maintains its data files and programs to manipulate
those files
A Short History of Database Technology – possibly different formats for the same data (e.g., length of names)
– redundant updates (e.g., to change an address)
• Prehistory
– file systems
– hierarchical and network systems
• The revolution: relational database technology Limitations of File-Based Approach
• Post-relational era
• Tendency to separate and isolate logically-related data
– new organizations of data, more complex data
– subsequent data models capture more information: true DB software “knows
– influence of object technology
about” some inter-entity or inter-file relationships
– more complex applications
• Separate files ⇒ redundancy in defining and storing data
– wasted storage space (only a problem for large applications)
– redundant efforts to enter replicated data and maintain its consistency (seri-
ous problem)
1 • Program-data dependence: data definition in application program ⇒ pro-
gram valid for only one DB with a fixed structure
• Labor intensive
– maintenance difficult (duplication, procedurality)
– inter-file links must be coded in programs

3
Traditional File-Based Approach

• Collection of applications that each define and manage their own files • File technology was the first attempt to computerize manual filing systems
• File: collection of records containing logically-related data (Pascal’s files of • It is an obsolete technology, except for applications that do not require DBMS func-
records, C++’s “struct” or “class” declarations, COBOL’s Data Division) tions. Even for those problems, when efficiency is not crucial and size is not too big,
it is often worth considering simple database software (like, e.g., ACCESS).
• Tight integration between program and data (files)
• Each enterprise function was responsible for its own data and accessed it with their
– physical storage structure visible in application code (e.g., ISAM, VSAM)
own programs
– run-time performance can (and must) be programmed
• File-based approach required much work from application programmers

History of Database Management- 1 History of Database Management- 2


Hierarchical and Network Systems

• Software systems without underlying organized discipline


• Lack of data independence Relational Model
– traditional view of data structures as records (on disk storage) + links: mix-
ture of physical and logical worlds • Defined in 1970 (T. Codd)
– user view dependent on physical details irrelevant for information needs • About 10 years of hesitation
– modern view: links = semantic relationships + physical access paths
• Robust RDBMSs built in the 80’s
• Navigational programming
• Historical compromise between scientists, practitioners, vendors
– all DB accesses imply procedural programming
• Most systems sold in the early 90’s are relational
– programming = “navigation” governed by physical access paths =⇒ long and
complex programs
– oriented towards one-record-at-a-time navigational programming ⇒ complex,
necessarily-manual performance optimization

4 5

• Pre-relational (or pre-historical) data management, unsystematic, adhoc


• Hierarchical systems (e.g., IMS, System 2000)

• CODASYL (network) database systems (e.g., IDS, IDMS, TOTAL, ADABAS, PHO- Novelty and Importance of the Relational Model
LAS)

• Example of data dependence: physical location of fields within records visible in pro- • Simplicity of basic concepts
grams
• Simple theoretical framework
• About pointers:
• Emergence of notion of data model
– pointers provide more explicit integration links (hierarchy, network) than rela-
tions • Data independence: separate universe of application from that of implemen-
tation
– confusion between logical and physical pointers was the problem: the relational
model established and demonstrated the virtues of data independence • Emergence of disciplined DB design
– somewhat cleaner pointers came back with object technology
• Validation of a declarative, high-level approach to data management
• Construction of powerful DBMS software
• New architectures
• Standards

History of Database Management- 3 History of Database Management- 4


Relational Model and Systems Founded Modern DB Technology

• Simplicity of basic concepts: simple, abstract data structure, independent of physical


considerations, sufficiently powerful and general as a realistic framework for numerous
database applications in business data processing
Relational Market Today
• Simple basis for a theoretical framework that stimulated DB research
– relational theory based on well-established theories: mathematical set theory • Most users still buy relational, huge installed base
and first-order predicate logic
• Leading relational vendors: Oracle, Sybase, Informix, IBM, Microsoft
– the DB field turns into a scientific discipline, the pre-relational era had drawn
little interest from computer scientists • Products of similar functionality and performance
– many interesting questions were raised for researchers and the field attracted a • Emphasis on client/server architecture, open systems, interoperability
lot of good minds; progress was quick in the 70’s
• System performance, robustness have consistently strengthened (e.g., recovery,
– remark about theory in general: theoretical basis 6= lack of practicality
concurrency support, distribution)
∗ theory = reliability, predictability
∗ a strong formal underlying theory is always interesting • Numerous client tools (4GLs), users want forms, pictures, graphs, sounds, etc.
∗ users need not master nor know about the whole theory not tables of numbers.

• Emergence of the notion of data model, quickly leading to semantic data models • But the web is making 4GLs and business forms history ...
with more semantics than the relational model, and a clean a posteriori definition of
• PC systems, competing with the larger DBMSs for lower-end applications
network and hierarchic data models. Data models include the definition of integrity
at the application-domain level, as a part of database design.

• Data independence: insulation of users from physical DB design and access opti-
mization; all general trend in informatics
7
• Emergence of disciplined DB design: a more semantic or conceptual view of infor-
mation structures (e.g., E-R) on top of the relational schema significantly simplifies
analysis and design process
• Design of high-level interactive database languages, based on set-at-a-time, non-
procedural operations, high-level operations (a first in the DB world): demonstrate
that a less procedural style of data manipulation was not only practically feasible but
also highly productive
• Construction of efficient DBMS software, in particular efficient query optimizers,
transaction management for multiple users (concurrency control), reliability and fail- Relational Market ...
ure recovery
• SQL has been overemphasized, overadvertised, has become inevitable (“COBOL
• New architectures: database machines, client-server, distributed databases, client
tools, 4GLs of the 90’s”)

• SQL Standard = portability of applications • Widespread lack of education on DB fundamentals


• Some built-in economic incentives for complexity and confusion: market for
books, journals, conferences, education, consulting, ...
• SQL standardization has become an inhibitor to the development of better user
interfaces

History of Database Management- 5 History of Database Management- 6


• Economics

– reduction of hardware costs (memory, computing cycles, soon network bandwidth


are essentially free at the margin)
– 25 years ago, one hour of machine time ≈ one week of programmer time
Post-Relational Era
– today, one week of programmer time ≈ a personal computer
• New organizations of data – relational technology has provided huge overall productivity gains
– entity-relationship model • Evolution of DBMS software
– nonnormalized relations
– development of efficient DBMS functions (e.g., relational query optimizers)
– object technology
– object-relational systems – easier DB programming
– semi-structured and unstructured data (WWW) – powerful and user-friendly front-end software (traditional DB software is progres-
• New functions sively becoming middleware)
– distribution
– heterogeneity (multidatabases, interoperability)
– deduction From the web:
– data analysis (data mining)
• More complex applications: data-management technology enters application If the automobile had followed the same development cycle as the computer, a Rolls-Royce
domains (e.g., design, geography, molecular biology, electronic commerce) would today cost 100 USD, get one million miles to the gallon, and explode once a year,
killing everybody inside.

9
Weaknesses of Relational Technology

• Built-in data types (essentially alphanumeric), inadequate for applications with


complex data (e.g., design, WWW, multimedia, GIS, biology)
• Poor conceptual modeling (e.g., generalization, aggregation, materialization must
Evolution = Economics + Methodology be explicitly managed by user programs)
• Separation between data and operations (stored procedures not integrated in the
• Economics model, no encapsulation)
• RDBMS have not fully exploited the relational model (e.g., domains, integrity-
– name of the game 30 years ago: optimize the use of expensive hardware
contraint support)
– analogous evolution in all domains of informatics (e.g., from machine language • Transaction model inappropriate for long-duration transactions for interactive,
to high-level programming languages) cooperative design environments
– human efficiency has become the crucial quality factor • Application programming: mismatch between SQL and traditional programming
• Methodological advances languages → 4GLs, OODBMSs
• Performance of RDBMS insufficient for decision-support applications (⇒ OLAP
– development of software engineering (“software crisis”)
technology)
– object technology
– methods for software development (from structured methods to OO analysis
and design)
11
– continuous migration of program functions into DBMS functions

10

History of Database Management- 7 History of Database Management- 8


Information-Management Context
Contributions of OO Technology
• Relational technology “solved” business data processing
• Objects are more natural than relations for real-world modeling • Success of database technology ⇒ users demand it for domains other than busi-
• Application functions are less stable than data ness data processing

• Complex and application-dependent data types simplify application development • Steady improvement in price/performance ratio of hardware and supporting ar-
chitectures
• Object identity, for object chaining, sharing
• “Software crisis”:
• Data encapsulation (ADTs)
– quality of application development is becoming a major concern
– some behavior naturally goes with data
– maintenance dominates overall development costs ⇒ reuse
– stable public interface for objects
– application-development methodology is improving
– greater stability, easier evolution
• Open systems ⇒ variety of system architectures
• Generalization and inheritance, for reuse, easier adaptation
• Transition to objects has become inevitable
• Polymorphism, for simpler programming, easier evolution

13
12

Application Context

• Today’s applications: objects + web + multimedia


• More complex data
– time series (data warehouses, data mining ⇒ progress in decision-support
systems)
– multimedia
– weakly-structured data (WWW)
– complex data (e.g., molecular biology, geographic information systems)
• Internet, emergence of electronic commerce

14

History of Database Management- 9 History of Database Management- 10


• Static HTML and GIF files are no longer enough
• Need for dynamic, interactive applications, with audio and video streams

• Sophisticated, integrated multimedia asset management

• Possible convergence data-modeling and document-management technologies:

– originally, most web documents were manually composed


– they are increasingly generated automatically from databases
Problems with OODBMSs
– HTML is a document-markup language, not a format for document integration
and exchange
• Language-bound (closely tied to C++, Smalltalk, Java)
– XML is closer to “semi-structured” databases
• Rediscover problems of tying database design close to application design
• Rediscover that declarative (SQL-like) approach is productive
• Lack of robustness for major DBMS functions for update-intensive OLTP appli-
cations (security, transaction processing, parallel processing)

Object-Oriented Database-Management Systems


(OODBMSs)

• Extension of object-oriented programming languages (OPLs) to persistent data


management
16
• Tighter (seamless) integration between OPLs (C++, Smalltalk, Java) and DBMS
• Extensible type system for complex objects
• OODBMSs manage (store, call, query) complex objects directly
• Performance thru caching large numbers of objects in address space of client
program

OODBMS Market
15
• Gradually appeared in the early 90’s
• Vendors: Object Design, Objectivity, Versant, Computer Associates (Jasmine),
Ardent (O2 )
• Niche market (about 100 times smaller than relational market, 100 M vs 10 B)
• Market development hindered by existing relational base

17

History of Database Management- 11 History of Database Management- 12


Object-Relational DBMSs

• Goals
– synthesis of the best of RDBMSs and OODBMSs, i.e., flexibility of data
model and performance of DBMS functions OR DBMS Market
– saveguard RDBMS investments
• Differences with RDBMSs • Main relational vendors: Informix, Oracle, Sybase, IBM, Microsoft
– extensible type system (application-dependent new types and functions) • Roughly similar products
– OO inheritance and polymorphism • Big business!
– better integration with PLs than SQL-92
• Differences with OODBMSs
– separate declarative data sublanguage (instead of smooth integration between
data language and PL)

18 20

Recent Evolutions
Problems with ORDBMS
• RDBMS vendors have maintained their strong position
• Hybrid approach • RDBMS are evolving into object-relational systems
• Technological integration without new concepts • New RDBMS developments ensure downward compatibility
• Weak on inheritance and polymorphism • Microsoft has entered the mainstream DB server market
• No sound theoretical foundations • OODBMS vendors have not moved from niche markets and have not displaced
• Bound to compromising RDBMS

• New, unproven • Standards have had a mixed role


• Tool vendors have lost importance through extensions of main DBMS products

19 21

History of Database Management- 13 History of Database Management- 14


New Topics (Buzzwords)

• Object technology
• Conceptual modeling
• Enterprise resource planning (ERP)
• Data mining
• Data warehouses
• Knowledge management, competitive intelligence
• Interoperability, heterogeneity, integration, ontologies, multidatabases
• Object-relational systems
• Workflow management

22

History of Database Management- 15

Anda mungkin juga menyukai