Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Instant Nokogiri

Buy
Instant Nokogiri, 9781783289974 (178328997X), Packt Publishing, 2013

Learning data scraping and parsing in Ruby using the Nokogiri gem

Overview

  • Learn something new in an Instant! A short, fast, focused guide delivering immediate results
  • Master Nokogiri with the use of clear, step-by-step instructions and real world examples
  • Learn how to identify sources, parse documents, and extract information from them
  • Use the interactive Ruby shell and the features of Nokogiri to test and refine your theories in real-time

In Detail

A wealth of information sits waiting on the Internet. Instant Nokogiri helps you access this information today with Nokogiri, a slick and fast HTML and XML parsing engine. Bundled in an easy-to-use Ruby gem, Nokogiri empowers you to combine disparate data sources and gain an unprecedented insight into your Ruby applications.

"Instant Nokogiri" is a hands-on guide to extracting information from the sources available on the Internet, sources that are not traditionally accessible to developers. You will learn the secrets of identifying content, extracting just the right parts, and incorporating the new data in your Ruby applications.

"Instant Nokogiri" provides step-by-step instructions on how to incorporate the power of the Nokogiri gem and data parsing into your Ruby projects. You will learn all the basics of designing a project around data parsing, exploring disparate data sources, and refining strategies and theories. You will also combine your thoughts in a real-world, real-data sample application. This book will examine common Nokogiri and Ruby methods useful in scraping and parsing complete with practical code samples. You will also learn the secrets behind effective caching, rate limiting, and masking your identity. Instant Nokogiri will teach you how to get targeted data out of HTML and into Ruby, as well as tons of tips, tricks, code snippets, and expert advice.

What you will learn from this book

  • Set up a development environment for Nokogiri
  • Know when to use a parsing engine
  • Identify ideal sources from which to extract content and devise optimal strategies for selecting content
  • Use CSS and XPath selectors to target content
  • Test your theories in an interactive Ruby shell
  • Work with live web data
  • Avoid detection and be a good netizen
  • Incorporate your finished snippets in a full Sinatra application

Approach

Get to grips with a new technology, understand what it is and what it can do for you, and then get to work with the most important features and tasks. A concise, illustrated guide to extracting information available on the Internet using Nokogiri.

Who this book is written for

"Instant Nokogiri" is the perfect choice for the aspiring Ruby developer looking to incorporate screen scraping and parsing technology in their applications. Beginner level Ruby, basic HTML, and CSS experience is suggested.

(HTML tags aren't allowed.)

FrontPage 2003 (The Missing Manual)
FrontPage 2003 (The Missing Manual)
In today's highly connected world, almost everybody has a web site, from local sewing circles to the world's largest corporations. If you're ready for one of your own, Microsoft's FrontPage 2003 has everything you need to create Web pages. It's true. Your geek friends may howl in contempt if you use FrontPage, but that's because the program has a...
Solving Everyday Problems with the Scientific Method: Thinking Like a Scientist
Solving Everyday Problems with the Scientific Method: Thinking Like a Scientist

This book describes how one can use The Scientific Method to solve everyday problems including medical ailments, health issues, money management, traveling, shopping, cooking, household chores, etc. It illustrates how to exploit the information collected from our five senses, how to solve problems when no information is available for the...

Handbook of X-Ray Data
Handbook of X-Ray Data
This sourcebook is intended as an X-ray data reference for scientists and engineers working in the field of energy or wavelength dispersive X-ray spectrometry and related fields of basic and applied research, technology, or process and quality controlling. In a concise and informative manner, the most important data connected with the emission...

Wiki: Web Collaboration
Wiki: Web Collaboration
Wikis are Web-based applications that allow all users not only to view pages but also to change them. The success of the Internet encyclopedia Wikipedia has drawn increasing attention from private users, small organizations and enterprises to the various possible uses of wikis.

Their simple structure and straightforward operation make them a...

A Classical Introduction to Cryptography Exercise Book
A Classical Introduction to Cryptography Exercise Book
This companion exercise and solution book to A Classical Introduction to Cryptography: Applications for Communications Security contains a carefully revised version of teaching material used by the authors and given as examinations to advanced-level students of the Cryptography and Security Lecture at EPFL from 2000 to mid-2005. A Classical...
CEA-CompTIA DHTI+ Digital Home Technology Integrator All-In-One Exam Guide, Second Edition
CEA-CompTIA DHTI+ Digital Home Technology Integrator All-In-One Exam Guide, Second Edition
A CEA-CompTIA DHTI+ Exam Guide and Desktop Reference--All in One!.

Get complete coverage of all the material included on the CEA-CompTIA DHTI+ Digital Home Technology Integrator exam inside this comprehensive resource. Written by industry experts, this definitive exam guide features learning objectives at the beginning of each...

©2021 LearnIT (support@pdfchm.net) - Privacy Policy