Haku

Työnhakusivujen verkkoharavointi

QR-koodi

Työnhakusivujen verkkoharavointi

The topic of this thesis is web scraping job listing search results from job portals. Since the dawn of the internet, e-recruitment has been a popular method for information technology job listings. There are various requirements relating to programming languages. From online job listings, job seeker can determine what are the most sought-after programming skills. Therefore, the purpose of this thesis is to determine the most popular programming languages in the current labor market. In addition, the thesis focuses on the Finnish job market.

The theoretical background of the thesis covers e-recruitment, web scraping in general, and ethical web scraping in particular. The empirical approach is mostly constructive and functional. In the functional part, a Python script can do web scraping. The web scraped data wasgathered from two job portals, which agreed to participate in the study. The data gathered was saved, analyzed and visualized via Microsoft Excel.

In the thesis, a web scraping method for gathering job listings data was constructed. Based on the results, the five most popular programming languages from Duunitori web portal are Python (1), SQL (2), React (3), Javacript (4) and Java (5), in this order. In the Jobly job portal the five most popular programming languages are PHP (1), React (2), Java (3), SQL (4) and Python (5). Based on the combined data set, the five most popular programming languages are Python (1), SQL (2), React (3), Java (4) and Javascript (5).

Tallennettuna: