Meeting: What’s New in Solr 5 Security & Solr Custom Collector: The Anti-Score

The arrival of Apache Solr 5 has brought with it a number of features. In these talks you will see some of the advantages of using Solr as your search engine.

Solr Meetup on Tuesday, August 11th in downtown Seattle discussing two topics on Solr :

  • What’s new in Solr 5 Security. Presented by Anshum Gupta, Lucidworks. Apache Solr has evolved into a highly scalable system, capable of handling a lot of data and high number of queries, but only recently was a mechanism to secure access in Solr provided. Apache Solr 5.2 shipped with pluggable authentication and authorization modules. These modules enable users to write their own plugins for managing security in Solr.

    This talk will cover an overview of both the authentication and authorization frameworks, and how they work together within Solr. It will also provide an overview of existing plugins and how to enable them to restrict user access to resources within Solr.

  • Solr Custom Collector: The Anti-Score. Presented by Michael Kosten, Getty Images. Sometimes, you don’t want to return just the top scoring documents as your search results.  If you have an eCommerce site, you may want to ensure that multiple lines of business are represented. If you incorporate customer interaction in your score, you may want to ensure that newer documents or certain categories are still represented and that your results don’t become stale. This requirement could be handled in middleware that post processes the search results, by requesting extra rows and rearranging them or by interleaving multiple queries. A better solution is to implement your own custom collector in Solr, so that search results can be arranged in any order. Michael will demonstrate a solution that returns top scoring documents, but grouped within categories. For example, a search for books could interweave the best fiction and non-fiction in a single query result. He will also demonstrate how to implement a custom priority queue to reduce memory requirements if there are many categories, and how the custom collector can be integrated into Solr without modifying the base distribution.

How to register for the talk : http://www.meetup.com/Seattle-Solr-Lucene-Meetup/events/223899316/

Recommended Reading/Viewing: Apache Search Engines (Lucene/Solr/ElasticSearch) Presentations

One aspect of most architectures on the web involve search.  I’ve been working with Apache Solr – a search engine that as it turns out fits the definition of NoSQL database (and some people use it as a read database).  If you don’t know what Solr is there are plenty of books worth reading on the topic.  And you should visit the Apache Solr site :

apache-solr

A commercially supported version of Solr is available from LucidWorks. If you are interested in learning a lot more, there are some nice videos from the Lucene Solr Revolution 2013 conference. Lots of very good talks, select the following ‘Recommended Viewing’ image to go to the conference video/PDF presentations :

lucene-solr-conf2013 A taste of some of the talks – first on the topic of ‘search’ itself.

and second, a talk on Solr as a NoSQL database from the creator of Solr  :

Another interesting search engine project is ElasticSearch.