Home | Amazing | Today | Tags | Publishers | Years | Account | Search 
Tika in Action

Buy
Tika in Action, 9781935182856 (1935182854), Manning Publications, 2011

Summary

Tika in Action is a hands-on guide to content mining with Apache Tika. The book's many examples and case studies offer real-world experience from domains ranging from search engines to digital asset management and scientific data processing.

About the Technology

Tika is an Apache toolkit that has built into it everything you and your app need to know about file formats. Using Tika, your applications can discover and extract content from digital documents in almost any format, including exotic ones.

About this Book

Tika in Action is the ultimate guide to content mining using Apache Tika. You'll learn how to pull usable information from otherwise inaccessible sources, including internet media and file archives. This example-rich book teaches you to build and extend applications based on real-world experience with search engines, digital asset management, and scientific data processing. In addition to architectural overviews, you'll find detailed chapters on features like metadata extraction, automatic language detection, and custom parser development.

This book is written for developers who are new to both Scala and Lift and covers just enough Scala to get you started.

Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book.

What's Inside
  • Crack MS Word, PDF, HTML, and ZIP
  • Integrate with search engines, CMS, and other data sources
  • Learn through experimentation
  • Many examples

This book requires no previous knowledge of Tika or text mining techniques. It assumes a working knowledge of Java.

(HTML tags aren't allowed.)

Developing Time-Oriented Database Applications in SQL
Developing Time-Oriented Database Applications in SQL
What is everywhere but occupies no space; can be measured but not seen or touched; can be spent, wasted, or killed, but not destroyed or changed? The simple answer is time, and the relationship between the fourth dimension and data is the foundation for Developing Time-Oriented Database Applications in SQL, a fascinating book by Richard T....
CMOS Multichannel Single-Chip Receivers for Multi-Gigabit Optical Data Communications (Analog Circuits and Signal Processing)
CMOS Multichannel Single-Chip Receivers for Multi-Gigabit Optical Data Communications (Analog Circuits and Signal Processing)
While the throughput of microprocessor systems tends to increase as a result of ongoing technology scaling and the advent of multi-core systems, the off-chip I/O communication bandwidth emerges as one of the potential bottlenecks that limit overall performance. In order to alleviate the communication speed constraints, optical data communication...
CCNP Practical Studies: Remote Access (CCNP Self-Study)
CCNP Practical Studies: Remote Access (CCNP Self-Study)

Gain hands-on experience of CCNP Remote Access topics with lab scenarios for the new 642-821 BCRAN exam.

  • Prepare for the CCNP 642-821 BCRAN exam and gain a better, practical understanding of exam concepts

  • Experience how remote access...


Seven Deadliest USB Attacks
Seven Deadliest USB Attacks

Do you need to keep up with the latest hacks, attacks, and exploits effecting USB technology? Then you need Seven Deadliest USB Attacks. This book pinpoints the most dangerous hacks and exploits specific to USB, laying out the anatomy of these attacks including how to make your system more secure. You will discover the best ways to defend...

Professional iPhone Programming with MonoTouch and .NET/C#
Professional iPhone Programming with MonoTouch and .NET/C#

What .NET C# developers need to enter the hot field of iPhone apps

iPhone applications offer a hot opportunity for developers. Until the open source MonoTouch project, this field was limited to those familiar with Apple’s programming languages. Now .NET and C# developers can join the party. Professional iPhone...

MCSE Self-Paced Training Kit (Exam 70-298): Designing Security for a Microsoft® Windows Server(TM) 2003 Network
MCSE Self-Paced Training Kit (Exam 70-298): Designing Security for a Microsoft® Windows Server(TM) 2003 Network

Ace your preparation for the skills measured by MCP Exam 70-298—and on the job—with this official Microsoft study guide. Work at your own pace through a system of lessons, practice exercises, and design activities.

Maximize your performance on the exam by learning how to:

  • Document the impact of business and...
©2020 LearnIT (support@pdfchm.net) - Privacy Policy