Sale

DADS404 DATA SCRAPPING

175549

DADS404 DATA SCRAPPING
SKU: N/A Categories: , , Tag:
Clear
Compare Compare

Description

Description

SESSION July 2023
PROGRAM MASTER OF BUSINESS ADMINISTRATION (MBA)
SEMESTER IV
course CODE & NAME DADS404 – DATA Scrapping
CREDITS 04
NUMBER OF ASSIGNMENTS & Marks 02

30 MARKS EACH

 

 Assignment Set – 1

 

1 a. Define data scraping and explain its significance in the digital age.

  1. List and briefly describe three tools used for data scraping. Discuss one advantage for each.

 

2 a. Outline the ethical considerations a data scraper must keep in mind. Why is respecting robots.txt important?

 

  

3 a. Given a sample website structure, identify the potential challenges in scraping data and suggest solutions.

  1. Explain the process of manual scraping using Python. Include a brief code snippet as an example.

 

Assignment Set – 2

 

1 a. Explain what is API-based scraping. Why is it often preferred over traditional web scraping methods?

  1. Describe how rate limits and authentication mechanisms work in API-based scraping, giving an example of a popular API that employs these

 

  1. Using Twitter as an example, discuss how you would access and scrape data using R. What challenges might arise?

 

 

 

3a. Describe the process of scraping cryptocurrency data. Highlight the importance of putting this data in a standard format and the potential challenges in doing so.

  1. Define ‘Data Quality’. Discuss two dimensions of data quality and explain how automated data quality checks can be beneficial.

 

Additional information

Assignment Type

General, Unique

Reviews

There are no reviews yet.


Be the first to review “DADS404 DATA SCRAPPING”

General Inquiries

There are no inquiries yet.