Scraping & APIs

Bradshaw, Paul. Scraping for Journalists. Leanpub, 2017

Bradshaw, Paul. What Data Journalists Need to Know About APIs. GIJN, 2022

Carter, Laura. There is always an element of judgement. Datakind UK

Google. Robots.txt files overview

Gold, Zachardy & Latonero, Mark. Robots Welcome? Ethical and Legal Considerations for Web Crawling and Scraping. Washington Journal of Law, Technology & Arts. 13/3, 2018, p. 275. Library resource

Golumbia, David. Fair Game: commonly used by researchers and journalists, data scraping is an underacknowledged privacy concern. 2022

Harlow, Max. Fetch and enrich data with APIs.

Heydt, Michael. Scraping. Code of conduct. Python web scraping cookbook, Packt, 2018. Library resource

Jarmul, Katherine & Lawson, Richard. Python web scraping: fetching data from the web. Packt, 2nd ed., 2017 Library resource

Kouzis-Loukas, Dimitrios. Learning Scrapy: learn the art of efficient web scraping and crawling with Python. Packt, 2016 Library resource

McCarthy, Kieran. Web scraping for me, but not for thee. 2023.

Mitchell, Ryan. Web scraping with Python. O’Reilly, 2nd ed., 2018 Library resource

Mitchell, Ryan. Legalities & ethics of web scraping (p. 265-79); Note on ethics (p. 217-18). Web scraping with Python. O’Reilly, 2nd ed., 2018 Library resource

Ni, Daniel. Five Tips for web scraping without getting booted. 2019

ONS. Web scraping policy

Schacht, Kira. A web scraping toolkit for journalists, 2019

Scrapinghub. Web Scraping Best Practices Guide

Sellars, Andrew. Twenty years of web scraping and the Computer Fraud and Abuse Act. Boston University Journal of Science & Technology Law. 24, 2018

Shiab, Nael. On the ethics of web scraping. GIJN, 2015

Shiab, Nael, Web scraping. A journalist’s guide, GIJN, 2015

Smith, Madolyn. APIs for journalism. Datajournalism.com

Sweigart, Al. “Web scraping”, Automate the boring stuff, No starch press, 2nd ed, 2020, p. 267-300.

The Markup. Why web scraping is vital to democracy. 2020

Velotio. Scraping guidelines & best practices