logo

Data Creation and Collection for Artificial Intelligence via Crowdsourcing

feature icon

Self-paced course

feature icon

Certification program

Price

Rating

Overview

Advances in Artificial Intelligence and Machine Learning have led to technological revolutions. Yet, AI systems at the forefront of such innovations have been the center of growing concerns. These involve reports of system failure when conditions are only slightly different from the training phase and they also trigger ethical and societal considerations that arise as a result of their use.

Machine learning models have been criticized for lacking robustness, fairness and transparency. Such model-related problems can generally be attributed to a large extent to issues with data. In order to learn comprehensive, fine-grained and unbiased patterns, models have to be trained on a large number of high-quality data instances with distribution that accurately represents real application scenarios. Creating such data is not only a long, laborious and expensive process, but sometimes even impossible when the data is extremely imbalanced, or the distribution constantly evolves over time.

This course will introduce an important method that can be used to gather data for training machine learning models and building AI systems. Crowdsourcing offers a viable means of leveraging human intelligence at scale for data creation, enrichment and interpretation with great potential to improve the performance of AI systems and increase the wider adoption of AI in general.

By the end of this course you will be able to understand and apply crowdsourcing methods to elicit human input as a means of gathering high-quality data for machine learning. You will be able to identify biases in datasets as a result of how they are gathered or created and select from task design choices that can optimize data quality. These learnings will contribute to an important set of skills that are essential for career trajectories in the field of Data Science, Machine Learning, and the broader realms of Artificial Intelligence.

At the end of this course you will be able to:

  • Examine the use of crowdsourcing for gathering data
  • Explain how cognitive biases and other human factors influence data quality
  • Describe the use of active learning in the creation of crowdsourced training data
  • Demonstrate the design of crowdsourcing tasks with quality control mechanisms
  • Discuss the evaluation of ML models with humans in the loop

Skills you will gain

Learning outcomes

Post this credential on your LinkedIn profile, resume, or CV, and don’t forget to celebrate your achievement by sharing it across your social networks or mentioning it during your performance review

Similar courses

course image
Minería de Datos: Análisis de la Canasta de Compra
logo
edX
course image
Data Analytics and Visualization Capstone Project
logo
edX
course image
Statistical Inference and Modeling for High-throughput Experiments
logo
edX
course image
Visualizing Data with R
logo
edX
course image
SQL Concepts for Data Engineers
logo
edX
course image
Big Data Strategies to Transform Your Business
logo
edX

Featured articles

Sep 12, 2022

WATCH these YouTube videos if you can't start learning a language

5

0
1
4K

Sep 12, 2022

How Memrise works + reviews [2022]

6

0
1
4K

Sep 12, 2022

5 tips to learn languages with YouTube videos [2022]

7

0
1
4K

Sep 12, 2022

How I Became a Marketing Manager at Microsoft

8

0
1
2K

Sep 24, 2022

How Edureka works + reviews [2022]

3

0
2
2K

Sep 27, 2022

How Codecademy works + reviews [2022]

3

0
2
2K
course image
feature icon

374 students

feature icon

42 Days

feature icon

English

feature icon

Beginner