[GeoNetwork-devel] Why id uuid tokenized

Any idea why this is? It doesn’t make much sense to me, but maybe I am missing something.

Jesse

I suspect it is being tokenized because the a-f in the hex numbers can be upper or lower case - so tokenizing the uuid on indexing and any uuid used as a search term makes all the comparisons in the same case. Maybe using a simple lower case analyzer as the default here rather than the GeoNetworkAnalyzer (=whitespace, stopwords and lower case) would be marginally simpler?

________________________________________
From: Jesse Eichar [jesse.eichar@anonymised.com]
Sent: Thursday, 13 December 2012 7:01 AM
To: Devel geonetwork-devel@lists.sourceforge.net; Jose Garcia; Francois Prunayre; Pigot, Simon (CMAR, Hobart); heikki doeleman
Subject: Why id uuid tokenized

Any idea why this is? It doesn't make much sense to me, but maybe I am missing something.

Jesse

Hi

I think was mainly to lowercase the content. Simon is right that LowerCaseAnalyzer should be better to use.

Regards,
Jose García

On Thu, Dec 13, 2012 at 1:26 AM, <Simon.Pigot@anonymised.com> wrote:

I suspect it is being tokenized because the a-f in the hex numbers can be upper or lower case - so tokenizing the uuid on indexing and any uuid used as a search term makes all the comparisons in the same case. Maybe using a simple lower case analyzer as the default here rather than the GeoNetworkAnalyzer (=whitespace, stopwords and lower case) would be marginally simpler?


From: Jesse Eichar [jesse.eichar@…189…]
Sent: Thursday, 13 December 2012 7:01 AM
To: Devel geonetwork-devel@lists.sourceforge.net; Jose Garcia; Francois Prunayre; Pigot, Simon (CMAR, Hobart); heikki doeleman
Subject: Why id uuid tokenized

Any idea why this is? It doesn’t make much sense to me, but maybe I am missing something.

Jesse


GeoCat Bridge for ArcGIS allows instant publishing of data and metadata on GeoServer and GeoNetwork. Visit http://geocat.net for details.


Jose García
GeoCat bv
Veenderweg 13
6721 WD Bennekom
The Netherlands
http://GeoCat.net

Hi All,

According to my vague memory of the standard, the letters had to be
either upper case or lower case, which, I can't remember. I think that
ESRI was using the wrong case and may have been a reason for the
tokenized values. I suggest that GeoNetwork use the correct case and
convert into that format.

John Hockaday

On Thu, 2012-12-13 at 08:29 +0100, Jose Garcia wrote:

Hi

I think was mainly to lowercase the content. Simon is right that
LowerCaseAnalyzer should be better to use.

Regards,
Jose García

On Thu, Dec 13, 2012 at 1:26 AM, <Simon.Pigot@anonymised.com> wrote:

> I suspect it is being tokenized because the a-f in the hex numbers can be
> upper or lower case - so tokenizing the uuid on indexing and any uuid used
> as a search term makes all the comparisons in the same case. Maybe using a
> simple lower case analyzer as the default here rather than the
> GeoNetworkAnalyzer (=whitespace, stopwords and lower case) would be
> marginally simpler?
>
>
> ________________________________________
> From: Jesse Eichar [jesse.eichar@anonymised.com]
> Sent: Thursday, 13 December 2012 7:01 AM
> To: Devel geonetwork-devel@lists.sourceforge.net; Jose Garcia; Francois
> Prunayre; Pigot, Simon (CMAR, Hobart); heikki doeleman
> Subject: Why id uuid tokenized
>
> Any idea why this is? It doesn't make much sense to me, but maybe I am
> missing something.
>
> Jesse
>

------------------------------------------------------------------------------
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
_______________________________________________ GeoNetwork-devel mailing list GeoNetwork-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/geonetwork-devel GeoNetwork OpenSource is maintained at http://sourceforge.net/projects/geonetwork