Data-driven decision-support systems have become part of the everyday process in the life of big companies. Nowadays, managers do not get up-to-date by checking the balance sheet and income statement for the past financial year, they analyse daily refreshed and more informative reports instead. Following the understanding and analysis of these reports, more informed and optimized decisions can be made by the management. This process is called business intelligence which means that decision-makers are able to perceive the business situation and to change the direction the organization is heading if necessary.
For business intelligence systems to be effective, a data warehouse is required, the data warehouse is basically the technical implementation of the backend which provides availability for the required data. The data warehouse integrates different source systems in a way that it is subject-oriented and time-variant, which makes query-based reports available to be generated.
In this project I design and develop a data warehouse that fits the above mentioned criteria. In the first part of the project I go through the design stages from understanding the data source through specifying reports to designing the data structure. In the second part I implement the data warehouse with Microsoft tools, namely Microsoft SQL Server and its various BI services. The data that I use for building the data warehouse is gathered from BKK (Budapesti Közlekedési Központ) which is the public transport company in the Hungarian capital. BKK developed a traffic management and information system that serves real-time information about routes, schedules, stops and alerts, another data source that is also used throughout this project is the BKK GTFS (General Transit Feed Specification) data, this serves offline (non-real-time) information about the schedule, stops and routes.