Apache Solr: A Practical Approach to Enterprise Search

  • 5h 12m
  • Dikshant Shahi
  • Apress
  • 2015

Apache Solr: A Practical Approach to Enterprise Search teaches you how to build an enterprise search engine using Apache Solr. You'll soon learn how to index and search your documents; ingest data from varied sources; pre-process, transform and enrich your data; and build the processing pipeline.

You will understand the concepts and internals of Apache Solr and tune the results for your client’s search needs. The book explains each essential concept - backed by practical and industry examples - to help you attain expert-level knowledge.

The book, which assumes a basic knowledge of Java, starts with an introduction to Solr, followed by steps to setting it up, indexing your first set of documents, and searching them. It then covers the end-to-end process of data ingestion from varied sources, pre-processing the data, transformation and enrichment of data, building the processing pipeline, query parsing, and scoring the document. It also teaches you how to make your system intelligent and able to learn through feedback loops.

After covering out-of-the-box features, Solr expert Dikshant Shahi dives into ways you can customize Solr for your business and its specific requirements, along with ways to plug in your own components. Most important, you will learn to handle user queries and retrieve meaningful results. The book explains how each user query is different and how to address them differently to get the best result. And because document ranking doesn't work the same for all applications, the book shows you how to tune Solr for the application at hand and re-rank the results.

You'll see how to influence user experience by providing suggestions and recommendations, and leveraging other interesting features of Solr. You'll also see how to integrate Solr with important related technologies like OpenNLP, Apache Tika, and Apache UIMA, among others, to take your search capabilities to the next level.

This book concludes with case studies and industry examples, the knowledge of which will be helpful in designing components and putting the bits together. By the end of Apache Solr, you will be proficient in designing, architecting, and developing your search engine and be able to integrate it with other systems.

About the Author

Dikshant Shahi manages the search and platforms team at OnMobile Global Limited. He has been responsible for developing several vertical search engines for categories including music metadata, voice, audio fingerprinting, channel intelligence, log file processing, building analytics, finding deals like Groupon etc. He has also been responsible for handling multi-lingual contents, natural language processing and recommendation. Shahi specializes in Search Engine, Information Retrieval, Data Extraction and Analysis, Application Development, Web Services, and Mobile Applications.

In this Book

  • Apache Solr: An Introduction
  • Solr Setup and Administration
  • Information Retrieval
  • Schema Design and Text Analysis
  • Indexing Data
  • Searching Data
  • Searching Data: Part 2
  • Solr Scoring
  • Additional Features
  • Traditional Scaling and SolrCloud
  • Semantic Search
SHOW MORE
FREE ACCESS