index groupwise archive with solr - solr

I would like to learn if anyone has successfully indexed GroupWise archive mail into Apache Solr/lucene?
Looking for assistance on setting up a fetch.
Thank you.

By Default, SOLR comes with clustering module i.e. carrot2
http://wiki.apache.org/solr/ClusteringComponent
Try this module and it will solve your issue.

Related

Solr Exception: Sort param field can't be found

I've been trying to set up CKAN, however I am facing some problems in SOLR.
Everytime I run CKAN, the SOLR log file registers a new event, and that is:
org.apache.solr.common.SolrException: sort param field can't be found: metadata_modified
I am trying to use CKAN for the first time and I have no experience at all, so I have no idea what that log event means, nor how to fix it.
EDIT:
When I reload the core at SOLR, the following is logged:
The schema has been upgraded to managed,​ but the non-managed schema schema.xml is still loadable. PLEASE REMOVE THIS FILE.
Could anyone help me?
Many thanks.
Well, turns out the SOLR 6, for some reason, was the problem.
Downgrading to version 5 worked for me.
This sounds like you are not using CKAN's custom Solr schema. Make sure to go over all points in the setup documentation, specially point 2, and to restart jetty afterwards:
sudo service restart jetty

How to index data using solr from another server?

I have some .xml file on window server. And my solr is running on another ubuntu server.
I want to index these files using solr. How to do this? plz help me to resolve this issue.
You can use the DataImport tool of SOLR which helps you to easily import an XML from Solr's web interface.
see the example of wikipedia here

Any way to see the inverted index of a doc in SOLR

I am wondering if there is a way to look at the inverted index of a doc in SOLR? I checked solr admin tool but couldn't find anything.
Check the LukeRequestHandler within Solr, should enable you to get the index data.
there is Luke but lately is is not being upgraded to latest lucene version, someone will recompile it soon probably.

How to do indexing data from database using apache solr with glassfish server on linux?

I want to create a search box in my web app using Apache Lucene and Apache Solr.I am using postgres database and have to do it with java.
As I new to these concepts (solr,lucene), I am struggling with this. I already installed and configured apache Solr with glassfish.Now I dont know how to start with this, Whether I have to cretae a java project in eclipse or I have to use Solr admin GUI.
can any one help me on this?
Thanks in Advance.....
In order to make data searchable, you have to first index your data. You can use one of the following ways to index data.
By using Solr clients such as Solrj
If you store your data in relational DB then you can use DataImportHandler
By posting XML or Json messages. Check here for documentation.
When new data added you can index it using Solr clients (Solrj). You can also search your data using Solrj or any other client libraries.
You can find other client libraries here.
You can start with Solr DIH to index the data from postgres to Solr.
For more detailed understanding you can refer to :-
how-to-import-data-from-sql-databases-part-1
how-to-import-data-from-sql-databases-part-2
how-to-import-data-from-sql-databases-part-3

Query Apache Solr for metadata

I was wondering if it was possible to query Solr for some index-metadata, e.g. the most frequent words in the index?
If so, how do you do this? Unfortunately I wasn't able to find anything about this.
thx for any help in advance
Have a look at the LukeRequestHandler

Resources