Electronic commerce is becoming more and more popular every day. We can select form the offers of countless online stores. The large number of online stores created the potential for various third party services and solutions. Such services, when third parties connected to the sites of the sellers give additional services there, or they give private service independent from the stores, which can be integrated by the sellers.
In Hungary such services are provided by, for example, by the Árukereső and the Árgép as well, to which the business must provide the data, through a data file in a special format frequently to keep the data as fresh as possible and it is the responsibility of the sellers. In this thesis, I examine how to automate this process, how to simplify the integration from the direction of online stores in different cases.
Using HTML metadata for the pages gives opportunity to provide the structure of the data of web pages in a standard way, using these structures for each product we can process them uniformly. Their use is becoming more widespread on the web than ever, as more search engines also use these data for displaying and formatting the results, so it can be ideal to solve this problem.
The use of metadata to simplify the integration process significantly, recording, updating data can become a well-automated process. During the study, I will examine the data extraction using HTML meta-data on some online available stores. I examine the applicability of the method, making algorithms that are able to obtain data for a given page, or even able to process the supply of shops based on some common sample as well.