Solr 101
Get started with Apache Solr, a scalable search engine. Learn how to index, search, and analyze data for enterprise-level search applications.
At a Glance
This course teaches you the basics of how to use Solr. Solr (pronounced “solar”) is an open source enterprise search platform, written in Java, from the Apache Lucene project.
ABOUT THIS COURSE
Learn the basics of Solr (pronounced “solar”), an open source enterprise search platform, written in Java, from the Apache Lucene project.
Solr is a standalone full-text search server that uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it usable from most popular programming languages.
- Learn about Solr’s major features, including full-text search, hit highlighting, faceted search, real-time indexing, dynamic clustering, database integration, NoSQL features and rich document (e.g., Word, PDF) handling.
- Learn how Solr is highly scalable and fault tolerant in providing distributed search and index replication.
- Learn why Solr is the most popular enterprise search engine.
COURSE SYLLABUS
- Module 1 – Search Engines
- Understand the importance of text search engines
- Understand the Solr search procedure
- Identify Solr components
- Module 2 – Configure and Add Documents to Solr
- Identifying the important files in a Solr installation
- Define the schema for documents in the index
- Understand the various ways to add documents to Solr
- Module 3 – Analyzers and Queries
- Use analyzers, tokenizers, and filters
- Construct queries
- Module 4 – SolrJ and Customization
- Create SolrJ applications
- Understand the customization options available in Solr
GENERAL INFORMATION
- This course is self-paced.
- It can be taken at any time.
- It can be audited as many times as you wish.
- Labs can be performed on the Cloud, or using a 64-bit system. If using a 64-bit system, you can install the required software (Linux-only), or use the supplied VMWare image. More details are provided in the section “Labs setup”.
RECOMMENDED SKILLS PRIOR TO TAKING THIS COURSE
- Basic knowledge of operating systems (UNIX/Linux).
- Basic understanding of SQL and Java would be helpful.
REQUIREMENTS
- None
COURSE STAFF
James Priebe
James Priebe is an IBM intern located in Toronto, Ontario. He spends his time creating proof of concept applications for IBM business partners and developing courses for customer education. He has worked with a variety of technologies in Big Data family, including Streams, Hadoop, and Annotation Query Language (AQL). James is from McMaster University, where he has completed his third year of the Software Engineering & Management program.
There are no reviews yet.