Website Scraping with Python: Using BeautifulSoup and Scrapy
Closely examine website scraping and data processing: the technique of extracting data from websites in a format suitable for further analysis. You'll review which tools to use, and compare their features and efficiency. Focusing on BeautifulSoup4 and Scrapy, this concise, focused book highlights common problems and suggests...
Build cool NLP and machine learning applications using NLTK and other Python libraries
About This Book
Extract information from unstructured data using NLTK to solve NLP problems
Analyse linguistic structures in text and learn the concept of semantic analysis and parsing
Spidering Hacks Written for developers, researchers, technical assistants, librarians, and power users, Spidering Hacks provides expert tips on spidering and scraping methodologies. You'll begin with a crash course in spidering concepts, tools (Perl, LWP, out-of-the-box utilities), and ethics (how to know when you've gone too far:...