Database Storage Engine

Two kinds of database storage engine are most popular:

Most of the databases internally use a log, which is an append only data file.

Since log file is append only file, so it keeps growing, hence resulting performance issue.

To mitigate these issues, a new concept came out called Indexing.

In order to efficiently find the value for a particular key in the database, we need a different data structure, this is called an Index.
In General the idea behind them is to keep some additional metadata on the side, which acts as a signpost and helps you to locate the data you want.
If you want to search some data in a several different ways, you may need several different indexes on different parts of the data.
An index is an additional structure that is derived from the primary data. Many databases allow you to add and remove indexes, and this does not affect the content of the database. It only affects the performance of the queries.

Any kind of index usually slow down writes, because the index also needs to be updated every time data is written.
This is an important trade-off in storage systems. Well chosen indexes speedup read queries but every index slow down writes.

While writing data in the datafile, it only appends the data file, so need to avoid running out of disk space.

Break the data file into small file segments. Once data file reaches to certain size, close this file and write data into new file.
Then we can perform compaction on the segments.

Compaction means throwing away duplicate keys in the log, and keeping only the most recent update for each key.

Compacted Segment.

Moreover, since compaction often makes segments much smaller, we can also merge several segments together at the same time as performing the compaction.
Segments (Data files) are never modified after they have written, so the merge segments is written to a new file.
The merging and compaction of frozen segments can be done in a background thread and while it is going on, we can still continue to serve read and write requests as normal, using old segment files.
After the merging process is complete, we switch reqd request to using the new merged segment. Old files can be deleted.
Each segment now has its own in-memory hash-table, mapping keys to file offsets.
In order to find values for a key, we first check the most recent segment's hash map, if the key is not present we check the second most recent segment and so on.
The merging process keeps the number of segments small, so lookup do not need to check many hash map.

Ex: you can not scan overall keys between mter0000 to mter9999 - You would have to lok up each key individually in the hash map.

What is next: SSTables and LSM Tree - Handles limitations of hash index.

System Design