Multimedia Database Management Systems

Author: Guojun Lu

Hardcover 373 pp.

Published in October 1999, by Artech House Publishers

ISBN: 0-89006-342-7

Traditional database management systems can’t handle the demands of managing multimedia data. With the rapid growth of multimedia platforms and the world wide web, database management systems must now process, store, index, and retrieve alphanumeric data, bitmapped and vector-based graphics, and video and audio clips both compressed and uncompressed. The comprehensive, systematic approach of Multimedia Database Management Systems presents you with current and emerging methods for managing the increasing demands of multimedia databases and their inherent design and architecture issues.

With this comprehensive resource, you learn how to create an effective multimedia database by integrating the various information indexing and retrieval methods currently available. Also, you learn to measure multimedia database performance that is based on similarity to queries and routinely affected by human judgement.

This book concludes with a discussion of networking and operating system support for multimedia databases and a look at current research and development in this dynamic field. It is a vital tool for database developers and managers, multimedia researchers, as well as web developers.

Outline:

  1. Introduction
  2. Multimedia Data types and Formats
  3. Multimedia Database Design Issues
  4. Indexing and Retrieval of Text Documents
  5. Indexing and Retrieval of Audio
  6. Image Indexing and Retrieval
  7. Video Indexing and Retrieval
  8. Integrated Multimedia Indexing and Retrieval
  9. Techniques and Data Structures for Efficient Multimedia Similarity Search
  10. System Support for Distributed Multimedia Databases
  11. Measurement of Multimedia Information Retrieval Effectiveness
  12. Products, Applications and New Developments.

Detailed Table of Contents

Preface xvii

Chapter 1 Introduction 1
1.1 Some Important Definitions 1
    1.1.1 Media Types and Multimedia 1
    1.1.2 Databases and Database Management Systems 2
    1.1.3 Text Document Information Retrieval 2
    1.1.4 Multimedia Indexing and Retrieval 3
    1.1.5 Feature Extraction, Content Representation and Indexing 3
1.2 Need for MIRS 3
    1.2.1 Proliferation of Multimedia Data and Their Characteristics 3
    1.2.2 DBMSs and Their Role in Handling Multimedia Data 4
    1.2.3 Information Retrieval Systems and Their Role in Multimedia
            Retrieval 7
    1.2.4 Integrated Approach to Multimedia Information Indexing and
            Retrieval 7
1.3 An Overview of the MIRS 7
1.4 Expected Capabilities and Common Applications of MIRS 8
1.5 Organization of the Subsequent Chapters 10
Problems 11
REFERENCES 11

Chapter 2 Multimedia Data Types and Formats 13
2.1 Introduction 13
2.2 Text 14
    2.2.1 Plain Text 14
    2.2.2 Structured Text 15
    2.2.3 Text Compression 15
            Huffman Coding 15
            Run-Length Coding 16
            Lempel-Ziv-Welch (LZW) Coding 17
2.3 Vector Graphics and Animation 17
2.4 Audio 18
    2.4.1 Basic Characteristics of Audio Signal 18
    2.4.2 Digital Representation of Audio 18
            Sampling 19
            Quantization 20
            Coding 20
            Determination of Sampling Rate 21
            Determining the Number of Quantization Levels 22
    2.4.3 Musical Instrument Digital Interface (MIDI) 24
    2.4.4 Audio Compression 24
            Nonlinear Quantization 24
            Predictive Coding 25
            Compression Technique Using Masking Property: MPEG-Audio 27
2.5 Digital Images 28
      2.5.1 Digital Image Representation 28
               Representation of Grey Scale Images 28
               Representation of Color Images 29
       2.5.2 Main Parameters of Digital Images 29
       2.5.3 Image Compression 29
                Spatial Subsampling 30
                Predictive Coding 30
                Transform Coding 31
                Vector Quantization 31
                Fractal Image Coding 32
                Wavelet Compression 33
                Practical Coding Systems 33
                The JPEG-Still Image Compression Standard 33
                JBIG 34
                JPEG-2000 36
2.6 Digital Video 36
    2.6.1 Digital Video Representation 36
    2.6.2 Video Compression 37
        2.6.2.1 Motion Estimation and Compensation 37
        2.6.2.2 MPEG 38
                    MPEG-1 39
                    MPEG-2 40
                    MPEG-4 41
                    MPEG-7 42
                    Other Standards 43
2.7 Standards for Composite Multimedia Documents 43
2.8 Major Characteristics and Requirements of Multimedia Data and
        Applications 44
    2.8.1 Storage and Bandwidth Requirements 44
    2.8.2 Semantic Structure of Multimedia Information 46
    2.8.3 Delay and Delay Jitter Requirements 46
    2.8.4 Temporal and Spatial Relationships Among Related Media 47
    2.8.5 Subjectiveness and Fuzziness of the Meaning of Multimedia Data 47
2.9 Summary 47
Problems 48
Further Reading 50
References 50


Chapter 3 Multimedia Database Design Issues 53
3.1 Introduction 53
3.2 MIRS Architecture 54
3.3 Data Models 55
    3.3.1 Data Model Requirements 55
    3.3.2 A General Multimedia Data Model 56
            Object Layer 56
            Media Type Layer 57
            Media Format Layer 57
            Remaining Issues 57
    3.3.3 Example Data Models 58
            VIMSYS Data Model 58
            A General Video Model 59
            Virage Image Schema Structure 60
3.4 User Interface Design 61
    3.4.1 Database Population 61
    3.4.2 Query Support 62
            Search 62
            Browsing 62
            Query refinement 63
    3.4.3 Result Presentation 63

3.5 Feature Extraction, Indexing and Similarity Measure 64
      3.5.1 Feature Extraction 64
      3.5.2 Indexing Structure 66
      3.5.3 Similarity Measurement 66
3.6 Quality of Service Guarantees in Clients, Servers and Communication
        Systems 66
3.7 Other Issues 67
    3.7.1 Multimedia Data Compression 67
    3.7.2 Data Representation Standardization 68
    3.7.3 Query Processing and Retrieval 69
3.8 Summary 69
Problems 69
Further Readings 71
References 71

Chapter 4 Text Document Indexing and Retrieval 73
4.1 Introduction 73
4.2 Differences Between IR Systems and DBMS 74
4.3 Automatic Text Document Indexing and Boolean Retrieval Model 75
    4.3.1 Basic Boolean Retrieval Model 75
    4.3.2 File Structure 76
            Inverted Files 76
            Extensions of the Inverted File Operation 77
    4.3.3 Term Operations and Automatic Indexing 78
    4.3.4 Summary of Automatic Document Indexing 81
4.4 Vector Space Retrieval Model 81
    4.4.1 Basic Vector Space Retrieval Model 81
    4.4.2 Relevance Feedback Techniques 82
            Query Modification 83
            Document Modification 83
4.5 Probabilistic Retrieval Model 84
4.6 Cluster-based Retrieval Model 84
    4.6.1 Cluster Generation 85
    4.6.2 Cluster-Based Retrieval 85
4.7 Non-traditional Information Retrieval Methods 85
4.8 Performance Measurement 86
4.9 Performance Comparison among Different IR Techniques 89
4.10 WWW Search Engines 89
    4.10.1 A Brief Introduction to the WWW 89
    4.10.2 Resource Discovery 92
    4.10.3 Major Differences Between IR Systems and WWW Search
                Engines 93
         4.10.3.1 WWW Documents are Distributed 94
         4.10.3.2 The Number of WWW Documents Is large 94
        4.10.3.3 WWW Documents are Dynamic and Heterogeneous 95
        4.10.3.4 WWW Documents Are Structured 95
        4.10.3.5 WWW Search Engines Are Heavily Used 96
        4.10.3.6 Other Issues 96
    4.10.4 General Structure of Www Search Engines 96
    4.10.5 An Example Search Engine 97
        4.10.5.1 Architecture Overview of Google 97
        4.10.5.2 Web Crawling 99
        4.10.5.3 PageRanks and Anchor Text 99
        4.10.5.4 Searching 100
4.11 Summary 100
Problems 100
Further Reading 103
References 103

Chapter 5 Indexing and Retrieval of Audio 105
5.1 Introduction 105
5.2 Main Audio Properties and Features 106
    5.2.1 Features Derived in the Time Domain 106
    5.2.2 Features Derived from the Frequency Domain 108
            Sound Spectrum 108
            Bandwidth 110
            Energy Distribution 110
            Harmonicity 110
            Pitch 111
    5.2.3 Spectrogram 111
    5.2.4 Subjective Features 112
5.3 Audio Classification 112
    5.3.1 Main Characteristics of Different Types of Sound 113
    5.3.2 Audio Classification Frameworks 113
5.4 Speech Recognition and Retrieval 116
    5.4.1 Speech Recognition 116
        5.4.1.1 Basic Concepts of ASR 116
        5.4.1.2 Techniques Based on Dynamic Time Warping 118
        5.4.1.3 Techniques Based on Hidden Markov Models 119
        5.4.1.4 Techniques Based on Artificial Neural Networks 121
        5.4.1.5 Speech Recognition Performance 121
    5.4.2 Speaker Identification 122
    5.4.3 Summary 122
5.5 Music Indexing and Retrieval 122
    5.5.1 Indexing and Retrieval of Structured Music and Sound Effects 122
    5.5.2 Indexing and Retrieval of Sample-Based Music 123
        Music retrieval based on a set of features 123
        Music retrieval based on pitch 124
5.6 Multimedia Information Indexing and Retrieval using Relationships
    Between Audio and Other Media 125
5.7 Summary 126
Problems 126
References 128

Chapter 6 Image Indexing and Retrieval 131
6.1 Introduction 131
6.2 Different Approaches to Image Indexing and Retrieval 132
6.3 Text-based Image Retrieval 132
6.4 Color Based Image Indexing And Retrieval Techniques 133
    6.4.1 The Basic Color Based Image Retrieval Technique 133
    6.4.2 Improvements to the Basic Technique 134
        6.4.2.1 Making Use of Similarity Among Colors 135
        6.4.2.2 Making Use of Spatial Relationships Among Pixels 137
        6.4.2.3 Making Use of the Statistics of Color Distribution 138
        6.4.2.4 Better Color Representation 138
            Different Color Spaces 138
            Variations of Image Representation 139
            Effects of Different Image Representations on Retrieval
                Performance 139
6.5 Image Retrieval Based on Shape 142
    6.5.1 Definitions of Common Terms and Some Simple Shape
        Measurements 143
    6.5.2 Invariant Moments 143
    6.5.3 Fourier Descriptors Method 145
    6.5.4 Histogram of Significant Edges 146
    6.5.5 Ordered List of Interest Points 146
    6.5.6 Elastic Template Matching 147
    6.5.7 Region-Based Shape Representation and Similarity Measure 147
        6.5.7.1 Basic Idea of Region-Based Shape Representation 148
        6.5.7.2 Rotation Normalisation 148
        6.5.7.3 Scale Normalisation 149
        6.5.7.4 Unique Shape Representation - Shape Index 149
        6.5.7.5 Similarity Measure 150
        6.5.7.6 Other Shape Operations 151
        6.5.7.7 Handling Multiple Major Axes 152
        6.5.7.8 Summary of Index and Retrieval Processes 152
        6.5.7.9 Retrieval Performance 153
6.6 Image Retrieval Based on Texture 155
6.7 Image Indexing And Retrieval Based On Compressed Image Data 156
    6.7.1 Image Indexing and Retrieval Based on DCT Coefficients 156
    6.7.2 Image Indexing and Retrieval Based on Wavelet Coefficients 157
    6.7.3 Image Indexing and Retrieval Based on VQ Compressed Data 157
6.8 Other Image Indexing and Retrieval Techniques 159
    6.8.1 Image Retrieval Based on Model-Based Compression 159
    6.8.2 Image Retrieval Based on Spatial Relationship 159
6.9 Integrated Image Indexing And Retrieval Techniques 159
    6.9.1 Query By Image Content (QBIC) 160
    6.9.2 Virage Image Search Engine 160
    6.9.3 WebSEEK 160
    6.9.4 ImageRover WWW Search Engine 161
Problems 161
Appendix A Color Representations 163
    A.1 Color Properties 164
    A.2 Color Specification Systems 164
        Device-Independent Color Specification 165
        Relationships Between CIE XYZ and Other Color Spaces 167
        Uniform Color Spaces 169
    A.3 Different Color Representations 170
        Gamma Correction 170
        Different RGB Systems or Color Spaces 173
References 175

Chapter 7 Video Indexing and Retrieval 179
7.1 Introduction 179
7.2 Overview of Shot-based Video Indexing and Retrieval 180
7.3 Video Shot Detection or Segmentation 181
    7.3.1 Basic Video Segment Techniques 181
    7.3.2 Detecting Shot Boundaries with Gradual Change 182
    7.3.3 Preventing False Shot Detection 183
    7.3.4 Other Shot Detection Techniques 184
    7.3.5 Segmentation of Compressed Video 185
        Based on MPEG Compressed Video 185
        Based on VQ Compressed Video 186
7.4. Video Indexing and Retrieval 186
    7.4.1 Indexing and Retrieval Based on R-Frames of Video Shots 186
    7.4.2 Indexing and Retrieval Based on Motion Information 188
    7.4.3 Indexing and Retrieval Based on Objects 189
    7.4.4 Indexing and Retrieval Based on Meta Data 190
    7.4.5 Indexing and Retrieval Based on Annotation 190
    7.4.6 Integrated Approach to Video Indexing and Retrieval 190
7.5 Effective Video Representation and Abstraction 191
    7.5.1 Topical or Subject Classification 191
    7.5.2 Motion Icon or Video Icon 192
    7.5.3 Video Streamer 193
    7.5.4 Clipmap 194
    7.5.5 Hierarchical Video Browser 194
    7.5.6 Storyboard 195
    7.5.7 Mosaicing 195
    7.5.8 Scene Transition Graph 195
    7.5.9 Video Skimming 196
7.6 Summary 196
Problems 196
References 198

Chapter 8 Integrated Multimedia Indexing and Retrieval 201
8.1 Introduction 201
8.2 Integrated Indexing and Retrieval Techniques 202
    8.2.1 Integrated Audio Indexing and Retrieval 203
    8.2.2 Integrated Image Indexing and Retrieval 204
    8.2.3 Integrated Video Indexing and Retrieval 204
    8.2.4 Merging of Results Obtained Based on Individual Features 204
    8.2.5 Media Translation 205
8.3 A General Architecture of Multimedia Information Management 205
8.4 User Interface 208
    Multimedia authoring and annotation 208
    Search and browsing 208
    Result presentation and relevance feedback 209
8.5 Example Systems 209
    8.5.1 QBIC 210
    8.5.2 An Integrated WWW Image Search Engine Developed at
            Monash University 211
        Text-based image indexing and retrieval 212
        Color-based image indexing and retrieval 213
        Image retrieval combining text- and colour-based methods 213
    8.5.3 Meta-Search Engines 214
        The query interface 215
        The query dispatching component 216
        The result merger 217
8.6 Summary 218
Problems 218
References 219

Chapter 9 Techniques and Data Structures for Efficient Multimedia
    Similarity Search 223
9.1 Introduction 223
9.2 Filtering Processes for Reducing Search Space 224
    9.2.1 Filtering with Classification, Structured Attributes and Keywords 225
    9.2.2 Methods Based on the Triangle Inequality 225
    9.2.3 Methods Specific to Color Histogram Based Retrieval 227
    9.2.4 Latent Semantic Indexing for Vector Space Based IR 228
9.3 B+- and B-trees 229
    9.3.1 B+-Trees 229
    9.3.2 B-Trees 233
9.4 Clustering 233
9.5 Multidimensional B+-tree 235
    9.5.1 Overview of MB+-Trees in Two Dimension Space 235
    9.5.2 Building a 2-d MB+-tree 237
    9.5.3 Search in MB+-trees 238
        Point query 238
        Range query 238
        k Nearest-neighbour query 238
    9.5.4 Higher Dimensional MB+-trees 239
9.6 k-d Trees 239
9.7 Grid Files 241
9.8 R-Tree Family 242
    9.8.1 An Overview of the R-Tree Structure 242
    9.8.2 Search, Insertion and Deletion of Region Objects 244
    9.8.3 Search, Insertion and Deletion of Point Data 244
    9.8.4 Search Efficiency in the R-Tree 245
    9.8.5 R*-tree, R+-tree and VAMSplit R-tree 245
    9.8.6 SS-Tree and SS+-Tree 246
9.9 The TV-tree 247
9.10 Summary 247
Problems 248
References 250

Chapter 10 System Support for Distributed Multimedia Databases 253
10.1 Introduction 253
10.2 Qos Management 256
    10.2.1 Definition 256
    10.2.2 General QOS Framework 257
    10.2.3 QOS Specification 258
    10.2.4 Admission Control, QOS Negotiation and Renegotiation 258
    10.2.5 Different Levels of Guarantee 259
    10.2.6 Providing QOS Guarantees 259
    10.2.7 An Example of QOS Handling 260
10.3 Design Goals of Multimedia Systems 260
10.4 Multimedia Data Storage Devices and Management 261
    10.4.1 Multimedia Storage Server Requirements 262
    10.4.2 Storage Devices 263
        10.4.2.1 Storage Capacity and Transfer Bandwidth Requirements 263
        10.4.2.2 Comparison of Different Types of Storage Devices 263
        10.4.2.3 Disk Arrays and RAID 264
            First-Level RAID: Mirrored Disks 264
            Second-Level RAID: Hamming Code for Error Correction 264
            Third-Level RAID: Single Check Disk Per Group 265
            Fourth-Level RAID: Independent Reads 265
            Fifth-Level RAID: No Dedicated Check Disk 266
        10.4.2.4 Storage Hierarchies 266
     10.4.3 Data Placement On Disks 267
        10.4.3.1 Contiguous Data Placement 267
        10.4.3.2 Scattered Data Placement 268
        10.4.3.3 Data Placement in Disk Arrays 269
    10.4.4 Disk Scheduling And Admission Control 269
        10.4.4.1 Traditional Disk-Scheduling Algorithms 270
        10.4.4.2 Earliest Deadline First 270
        10.4.4.3 Scan-Earliest Deadline First 271
        10.4.4.4 Round-Robin 271
        10.4.4.5 Group Sweeping Scheduling 272
    10.4.5 Provision Of User Interaction 273
        10.4.5.1 Pause and Resume 273
        10.4.5.2 Fast Forward and Backward 275
        10.4.5.3 QOS Issue Related to User Interactions 275
    10.4.6 Server Configuration And Network Connection 276
    10.4.7 Summary 277
10.5 Multimedia Computer Architectures 277
    10.5.1 Processor Architectures for Multimedia 278
        10.5.1.1 SIMD and VLIW 278
        10.5.1.2 Dedicated Multimedia Processors 279
        10.5.1.3 General-Purpose Processors 279
    10.5.2 Multimedia Computer Architectures 280
        10.5.2.1 The Basic Architecture 280
        10.5.2.2 The Use of Local Buses 281
        10.5.2.3 The Use of Dedicated Multimedia Devices 282
        10.5.2.4 Network-Based Multimedia Computer Architecture 283
10.6 Multimedia Operating Systems 284
    10.6.1 Multimedia Operating System Requirements 284
    10.6.2 Design Issues of Multimedia Operating Systems 285
    10.6.3 Conventional Time-sharing Operating Systems and
                Incorporation of Real-time Features 285
     10.6.4 Solutions To Data-copying Problem 287
            10.6.4.1 Data-Copying Problem 287
            10.6.4.2 Two Data Movement Methods 288
            10.6.4.3 Single-Copy and Zero-Copy Architecture 288
     10.6.5 Solutions To Reduce Context And Domain Switch Overhead 289
     10.6.6 QOS Support 290
        10.6.6.1 QOS Specification 290
        10.6.6.2 Admission Control 290
        10.6.6.3 Resource Reservation and Policing 291
        10.6.6.4 Process Scheduling Disciplines 291
            Rate Monotonic 292
            Earliest Deadline First 292
            Time-Driven Resource Management 293
        10.6.6.5 QOS Graceful Degradation and Media Scaling 293
10.7 Multimedia Networks 294
    10.7.1 Network Characteristics Suitable For Multimedia Communications 295
        10.7.1.1 Network Speed or Bandwidth 295
        10.7.1.2 Efficient Sharing of Network Resources 296
        10.7.1.3 Performance Guarantees 297
        10.7.1.4 Network Scalability 298
        10.7.1.5 Networks Suitable for Multimedia Communications 298
    10.7.2 Asynchronous Transfer Mode 300
        10.7.2.1 What Is ATM? 300
        10.7.2.2 B-ISDN Protocol Reference Model 302
        10.7.2.3 Why Is ATM Suitable for Multimedia Communications? 304
        10.7.3 Network Performance Guarantees 305
10.8 Multimedia Transport Protocols 306
    10.8.1 Requirements Of Multimedia Transport Protocols 306
        10.8.1.1 High Throughput 307
        10.8.1.2 QOS Specification and Guarantee 307
    10.8.2 Why Traditional Transport Protocols Are Not Suitable
            For Multimedia Communications 307
        10.8.2.1 Data Copying 308
        10.8.2.2 Flow Control 308
        10.8.2.3 Error Control 309
        10.8.2.4 Positioning and Handling of Control Information 309
        10.8.2.5 Lack of QOS Support 310
        10.8.2.6 Suitability of eXpress Transport Protocol 310
    10.8.3 Resource Reservation Protocols 310
        10.8.3.1 ST-II 311
            ST-II Connection Setup 311
            Flow Specification 312
        10.8.3.2 RSVP 313
        10.8.3.3 Comparison of ST-II and RSVP 314
    10.8.4 Real-time Transport Protocol (RTP) 315
    10.8.5 Other Multimedia Transport Protocols: HeiTP and Tenet 316
10.9 Achieving Overall Synchronous Presentation 317
    10.9.1 Synchronization Specification 318
        10.9.1.1 Scripts 318
        10.9.1.2 Time-Line-Based Temporal Specification 319
        10.9.1.3 Petri Nets 320
    10.9.2 Analysis of Causes of Losing Multimedia Synchronization 320
    10.9.3 Mechanisms To Achieve Multimedia Synchronization 322
        10.9.3.1 Measures To Counter Network Delay Variations 322
                Corrective Measures 322
                Preventive Measures: Network and Transport Support
                    for Continuous Media 324
        10.9.3.2 Measures To Counter Media-Specific Processing Skew 324
        10.9.3.3 Measures To Counter Clock Rate Difference 324
        10.9.3.4 Measures To Counter Packet-Out-of-Order Problems 325
        10.9.3.5 Measures To Coordinate Multiple Sources for
            Synchronous Transmission 325
        10.9.3.6 Workstation and Server Support for Continuous Media 326
        10.9.3.7 Measures To Provide User Interaction Synchronization 326
        10.9.3.8 Playout Techniques To Achieve Optimal Presentation
            Effects 326
    10.9.4 An Ultimate Solution Based On QOS Framework 327
Problems 327
References 330

Chapter 11 Measurement of Multimedia Information Retrieval Effectiveness 335
11.1 Introduction 335
11.2 Collection Of Human Judgement Data 336
    Method 1 336
    Method 2 336
    Method 3 337
    Selection of data collection methods 337
    Need for standard test databases 337
11.3 Recall And Precision Pair (RPP) 338
11.4 Percentage Of Weighted Hits (PWH) 338
11.5 Percentage Of Similarity Rankings (PSR) 339
11.6 Suitability Of The Common Effectiveness Measurements 339
    11.6.1 A Numerical Example 340
    11.6.2 Strengths and Weaknesses of PSR, PWH and RPP 342
11.7 Modified RPP 343
11.8 Factors Affecting Retrieval Effectiveness 343
11.9 Summary 344
Problems 344
References 345

Chapter 12 Products, Applications and New Developments 347
12.1 Introduction 347
12.2 Products Of Multimedia Database Management Systems 348
12.3 Applications Of Multimedia Indexing And Retrieval 348
    12.3.1 WWW Multimedia Search Engines 349
    12.3.2 Digital Libraries 349
    12.3.3 Video-on-demand Systems 350
        12.3.3.1 Networking Support for VOD 351
            Asymmetric Digital Subscriber Line 351
            Hybrid Fiber Coax 352
            High-Speed Digital Subscriber Line 353
            Direct Optical Fiber Connection 353
        12.3.3.2 Set-Top-Boxes 353
        12.3.3.3 Future Developments 354
12.4 Multimedia Security 354
    12.4.1 Providing Privacy and Confidentiality 355
    12.4.2 Authentication Verification 356
    12.4.3 Copyright Protection 357
12.5 MPEG-7 357
Problems 359
References 359

List of Acronyms 361

About the Author 367

Index 369