Information Retrieval in MongoDB Data storage, Indexing and Querying - - PowerPoint PPT Presentation

information retrieval in mongodb
SMART_READER_LITE
LIVE PREVIEW

Information Retrieval in MongoDB Data storage, Indexing and Querying - - PowerPoint PPT Presentation

Information Retrieval in MongoDB Data storage, Indexing and Querying Kaustubh Dhokte (NB97699) University Of Maryland Baltimore County CMSC 676: Information Retrieval Agenda Introduction to MongoDB MongoDB Architecture MongoDB Storage


slide-1
SLIDE 1

Information Retrieval in MongoDB

Data storage, Indexing and Querying

Kaustubh Dhokte (NB97699) University Of Maryland Baltimore County CMSC 676: Information Retrieval

slide-2
SLIDE 2

Agenda

´ Introduction to MongoDB ´ MongoDB Architecture ´ MongoDB Storage Architecture ´ Indexing in MongoDB ´ MongoDB Query Engine

slide-3
SLIDE 3

MongoDB

´ Database Engine ´ Open Source ´ Distributed ´ Full Fledged

´ Text Search ´ Aggregation ´ Cloud Based

slide-4
SLIDE 4

MongoDB Architecture

slide-5
SLIDE 5

MongoDB Architecture

´ RDBMS or Document Based? ´ SQL or NoSQL? ´ Dynamic ´ Horizontally Scalable

´ Sharding

´ Fault Tolerant

´ Replica Sets

´ Load Balanced

slide-6
SLIDE 6
slide-7
SLIDE 7

MongoDB Storage Architecture

slide-8
SLIDE 8

MongoDB Storage Architecture

slide-9
SLIDE 9

MongoDB Data Model

´ Databases ´ Collections ´ Indexes ´ BSON http://bsonspec.org/

MongoDB Query Language

´ Database drivers of popular languages

slide-10
SLIDE 10

MongoDB Storage Engines

´ WiredTiger (Default) ´ MMAPv1 (Deprecated in latest version of MongoDB) ´ In-memory

´ Non-persistent

´ Encrypted ´ 3rd Party Integrations

´ E.g. RocksDB

slide-11
SLIDE 11

MongoDB Indexing

slide-12
SLIDE 12

MongoDB Indexing

´ Index Data Structure: B-Tree ´ Primary Indexes: _id ´ Secondary (User defined indexes)

´ Single Field ´ Compound Index ´ Multikey ´ Text ´ Hashed

slide-13
SLIDE 13

MongoDB Indexes Properties

´ Unique ´ Partial ´ TTL indexes

slide-14
SLIDE 14

MongoDB Query Engine

slide-15
SLIDE 15

MongoDB Query Engine

slide-16
SLIDE 16

Query Planner

´ Inputs

´ Predicate ´ Projection ´ Sort

´ Outputs Blueprint of what happens in query execution

slide-17
SLIDE 17

Query Planner

´ Index Selection

´ IXSCAN ´ COLLECTIONSCAN ´ AND/OR

´ Access: Plan Generation ´ Analysis: Sort, Project

slide-18
SLIDE 18

Query Planner

´ Selecting Best Plan

´ One Work Unit:

´ One index key ´ One document

´ Results Produced per work unit executed

´ Plan Ranking ´ Plan Caching

slide-19
SLIDE 19

References

´ https://www.researchgate.net/figure/MongoDB-architecture- researchgate-2018_fig2_330841309 ´ https://www.mongodb.com/blog/post/building-applications-with- mongodbs-pluggable-storage-engines-part-1 ´ https://www.slideshare.net/NorbertoLeite/mongodb-internals-55965341 ´ https://docs.mongodb.com/manual/

slide-20
SLIDE 20

Thank You