0% Complete
Home
/
12th International Conference on Computer and Knowledge Engineering
IranITJobs2021: a Dataset for Analyzing Iranian Online IT Job Advertisements Collected Using a New Crowdsourcing Process
Authors :
Fakhroddin Noorbehbahani
1
Nikta Akbarpour
2
Mohammad Reza Saeidi
3
1- university of isfahan
2- university of isfahan
3- university of isfahan
Keywords :
Dataset collection،Crowdsourcing،Data analytics،Job posting
Abstract :
Gathering and preparing high-quality data is one of the most significant and expensive steps in data analytics. Crowdsourcing is an efficient way to create datasets for machine learning and data science applications. However, it is vital to apply a proper crowdsourcing process for dataset creation to ensure the quality of the collected data. To our best knowledge, there is no crowdsourcing process specially designed for dataset collection. In this paper, a new process to create high-quality datasets based on crowdsourcing is proposed, including pre-gathering, gathering, and post-gathering phases. Today employers and job seekers benefit from online job postings and social media sites for recruitment more than ever before. Consequently, a huge volume of job posting data is available that enforces the need for data visualization and data analytics for extracting valuable insights to help better decision making. Although there exist several online job advertisement datasets for analyzing job demand and requirements, there is no such dataset about the IT job market in Iran. In this paper, IranITJobs2021, an online IT job posting dataset, is presented, which is produced using the proposed dataset collection process. IranITJobs2021 includes job advertisements related to information technology from August 2019 to January 2021. The dataset incorporates 1300 instances and 13 features which is publicly available. IranITJobs2021 could be analyzed to find valuable patterns of job requirements and skills in the field of information technology. Furthermore, the proposed dataset collection process is applicable to create datasets efficiently.
Papers List
List of archived papers
DIPT: Diversified Personalized Transformer for QAC systems
Mahdi Dehghani - Samira Vaez Barenji - Saeed Farzi
Explainable Error Detection Method for Structured Data using HoloDetect framework
Abolfazl Mohajeri Khorasani - Sahar Ghassabi - Behshid Behkamal - Mostafa Milani
A Cloud Broker with Gap Analysis Perspective for Scheduling Multi-Workflows Across On-Demand and Reserved Resources
Negin Shafinezhad - Hamidreza Abrishami - Saeid Abrishami
The process of multi class fake news dataset generation
Sajjad Rezaei - Mohsen Kahani - Behshid Behkamal
A Federated Learning-Based Hybrid Deep Learning Framework for Enhanced Human Activity Recognition
Jamileh Azmoudeh - Sajjad Arghaee - Parisa Valizadeh - Samaneh Dandani - Iman Havangi - Mohammad Hossein Yaghmaee
Novel Insights in Deep Learning for Predicting Climate Phenomena
Mohammad Naisipour - Saghar Ganji - Iraj Saeedpanah - Behnam Mehrakizadeh - Ahmad Reza Labibzadeh
An intelligent linguistic error detection approach to automated diagnosis of Dyslexia disorder in Persian speaking children
Fatemeh Asghari - Mahsa Khorasani - Mohsen Kahani - Seyed Amir Amin Yazdi - Mahdi Arkhodi Ghalenoei
An overview of Business Intelligence research in healthcare organizations using a topic modeling approach
Mohammad Mehraeen - Laya Mahmoudi - Mohammad Hossein Sharifi
Developing Convolutional Neural Networks using a Novel Lamarckian Co-Evolutionary Algorithm
Zaniar Sharifi - Khabat Soltanian - Ali Amiri
InfOnto: An ontology for fashion influencer marketing based on Instagram
Somaye Sultani - Mohsen Kahani
more
Samin Hamayesh - Version 42.4.1