Implementing a document database system

OData support
Supervisor:
Kövesdán Gábor
Department of Automation and Applied Informatics

The purpose of this thesis is the planning of a document archiving system, that has different database layers, which will be compared. Firms may need to store documents, which contains standards, contracts, images, texts, all this within a well structured system. This intent brings up the question of versioning, because most of the standard evolve over time. In this case, the older versions are needed too, they must be reachable in special cases.

The strict structure of relational databases does not allow us to store huge amount of plain data, files for example, inside the database, to avoid this, the files themselves are stored on other parts of the file system, and only references are stored in the specified records.

In comparison to this, the so called NoSQL databases' loose structure does not deny it in any matter. These kinds of databases are becoming widely spread in the informatics industry, which use a different data model to the relation kind (i.e. :graph, key-value pairs, documents), that supports more the handling of connections between data.

The goal here is to examine the chosen NoSQL database management system and the storage of the files. On the other hand it is necessary to check the advised methods of storing files in relational databases too. For the examination of the different databases, it is needed to develop a data intensive application, to demonstrate the two kinds of working, what are the advantages and disadvantages of each.

To develop this application, I chose the Java programing language, the legacy softwares which are used in corporations show a real success with this language and environment. Wide range of technological support are provided for numerous databases.

For the ease of use the application includes a browser displayed surface, from where the documents can be manipulated.

After the two implementation with different databases are finished, the ending of this thesis will be to document the differences and heavy tests of file handling.

Downloads

Please sign in to download the files of this thesis.