[GeoNetwork-devel] Fuzzy search prototype

Hi list, for the time being, GN is using exact TermQuery but looks a bit too precise to search sometimes.

I’ve made a change to be able to use the FuzzyQuery of Lucene. This will allow to get more results avoiding typo, accents and special characters. The fuzziness is defined by a parameter named “similarity”.

By default similarity is set to 0.8 (default value could be change on the main-page.xsl). In the advanced search, a simple “scriptaculous” slider allows changing the similarity (from 0 to 1). 1 equal TermQuery. It will be applied to all text field (and not for geospatial Query) :

  • Title,
  • Abstract,
  • Free text,
  • Keywords

Demo (based on CVS Head) is available :
http://dev.sandre.eaufrance.fr/geonetworkalpha/srv/en/main.home

Search for phisiographic, phisyographic will get the “Physiographic Map of North and Central Eurasia”, africà, …etc.

Any remarks, bugs, browser compatibility issue ?

Cheers. Francois


Ce message a ete verifie par MailScanner pour des virus ou des polluriels et rien de suspect n’a ete trouve.

Les donnees et renseignements contenus dans ce message sont personnels, confidentiels et privés.Toute publication, utilisation ou diffusion, meme partielle, doit etre autorisee.
Any data and information contained in this electronic mail is personal, confidential and private. Any total or partial publication, use or distribution must be authorized.

Hi Francois,

if you send me a diff against geonetwork 2.0.2 I can add your patch.
Just a diff -r -c -x CVS

Cheers,
Andrea

Hi list, for the time being, GN is using exact TermQuery but looks a bit too
precise to search sometimes.

I've made a change to be able to use the FuzzyQuery of Lucene. This will
allow to get more results avoiding typo, accents and special characters. The
fuzziness is defined by a parameter named "similarity".

By default similarity is set to 0.8 (default value could be change on the
main-page.xsl). In the advanced search, a simple "scriptaculous" slider
allows changing the similarity (from 0 to 1). 1 equal TermQuery. It will be
applied to all text field (and not for geospatial Query) :
- Title,
- Abstract,
- Free text,
- Keywords

Demo (based on CVS Head) is available :
http://dev.sandre.eaufrance.fr/geonetworkalpha/srv/en/main.home

Search for phisiographic, phisyographic will get the "Physiographic Map of
North and Central Eurasia", africà, ...etc.

Any remarks, bugs, browser compatibility issue ?

Cheers. Francois