[GeoNetwork-devel] repository items and sitemaps

I had a request today to produce all items in my GN repository
referenced in a sitemap.xml file [1].

I think it would be valuable to be able to (optinoally) expose GN
repository resources with the sitemap.xml format. Something like:

(http://devgeo.cciw.ca/geonetwork/srv/en/portal.sitemap?format=xml)

...where format can control XML (FGDC/ISO/DC/etc.) or HTML output.

...can return a sitemap document which folks can submit to search engine
crawlers.

Comments?

..Tom

[1] http://www.sitemaps.org/protocol.php

+1

We must expose them to the crawlers. Good idea Tom. How hard to
implement is this?

Eduin Yesid Carrillo Vega
Ingeniero Civil, Esp. Sistemas de Información Geográfica
Grupo IDE & GIG
Centro de Investigación y Desarrollo en Información Geográfica
Instituto Geográfico Agustín Codazzi - IGAC
Cr 30 48-51
Tel +57 1 3694000 ext 4305
Bogotá D.C., Colombia

On Tue, Apr 14, 2009 at 2:18 PM, Kralidis,Tom [Ontario]
<Tom.Kralidis@anonymised.com> wrote:

I had a request today to produce all items in my GN repository
referenced in a sitemap.xml file [1].

I think it would be valuable to be able to (optinoally) expose GN
repository resources with the sitemap.xml format. Something like:

(http://devgeo.cciw.ca/geonetwork/srv/en/portal.sitemap?format=xml)

...where format can control XML (FGDC/ISO/DC/etc.) or HTML output.

...can return a sitemap document which folks can submit to search engine
crawlers.

Comments?

..Tom

[1] http://www.sitemaps.org/protocol.php

I implemented this in Python. Basic steps:

- query database with "select id, schemaId, changeDate from Metadata where isTemplate = 'n'"

Return XML like:

<urlset
xmlns="http://www.sitemaps.org/schemas/sitemap/0.9&quot;
xmlns:geo="http://www.google.com/geo/schemas/sitemap/1.0&quot;
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance&quot;
xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9 http://www.sitemaps.org/schemas/sitemap/0.
9/sitemap.xsd">
<url>
  <loc>http://host/geonetwork/srv/en/fgdc.xml?id=194&lt;/loc&gt;
  <lastmod>2009-04-09T15:15:04</lastmod>
  <changefreq>monthly</changefreq>
  <geo:geo>
   <geo:format>fgdc-std</geo:format>
  </geo:geo>
</url>
...
</urlset>

The value in /urlset/url/loc would be either XML (the process would have to figure out the schemaId, and then create the URL appropriately), or HTML (e.g. http://host/geonetwork/srv/en/metadata.show?id=194).

..Tom

-----Original Message-----
From: Eduin Carrillo [mailto:yecarrillo@anonymised.com]
Sent: Tue 14-Apr-09 15:53
To: geonetwork-devel@lists.sourceforge.net
Subject: Re: [GeoNetwork-devel] repository items and sitemaps

+1

We must expose them to the crawlers. Good idea Tom. How hard to
implement is this?

Eduin Yesid Carrillo Vega
Ingeniero Civil, Esp. Sistemas de Información Geográfica
Grupo IDE & GIG
Centro de Investigación y Desarrollo en Información Geográfica
Instituto Geográfico Agustín Codazzi - IGAC
Cr 30 48-51
Tel +57 1 3694000 ext 4305
Bogotá D.C., Colombia

On Tue, Apr 14, 2009 at 2:18 PM, Kralidis,Tom [Ontario]
<Tom.Kralidis@anonymised.com> wrote:

I had a request today to produce all items in my GN repository
referenced in a sitemap.xml file [1].

I think it would be valuable to be able to (optinoally) expose GN
repository resources with the sitemap.xml format. Something like:

(http://devgeo.cciw.ca/geonetwork/srv/en/portal.sitemap?format=xml)

...where format can control XML (FGDC/ISO/DC/etc.) or HTML output.

...can return a sitemap document which folks can submit to search engine
crawlers.

Comments?

..Tom

[1] http://www.sitemaps.org/protocol.php

------------------------------------------------------------------------------
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
_______________________________________________
GeoNetwork-devel mailing list
GeoNetwork-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/geonetwork-devel
GeoNetwork OpenSource is maintained at http://sourceforge.net/projects/geonetwork