In 2.7 (with log level set to DEBUG for log4j.logger.geonetwork.harvester in web/geonetwork/WEB-INF/log4j.cfg) I can see the harvester parsing the GetCapabilities statement returned by the ESRI Geoportal (9.3) software and then requesting the records using GetRecordById something like as follows:
http://gis.ncdc.noaa.gov/geoportal/csw/discovery?request=GetRecordById&service=CSW&version=2.0.2&outputSchema=http%3A%2F%2Fwww.opengis.net%2Fcat%2Fcsw%2F2.0.2&elementSetName=full&id=gov.noaa.cpc%3Aepachurr
The response from the ESRI Geoportal to this request is an exception:
<?xml version="1.0" encoding="UTF-8"?>
<ExceptionReport version="1.2.0" xmlns="http://www.opengis.net/ows">
<Exception exceptionCode="InvalidParameterValue" locator="ElementSetName">
<ExceptionText>
ElementSetName is not valid for 'original' outputScheme
</ExceptionText>
</Exception>
</ExceptionReport>
This response seems to me to be a trifle odd as both the outputSchema and the elementSetName used by GeoNetwork are advertised in the Geoportal's GetCapabilities statement in the GetRecordById operation section!
Fiddling with the GetRecordById request outside of GeoNetwork, I can get the record if I drop the outputSchema, and ElementSetName works just fine with brief and summary as well as full. Even more odd is that 'original' is also listed as an outputSchema and when used as an outputSchema it works correctly by returning a dc metadata record.
Seems like it would help if these sites would upgrade to the open source Geoportal (see interop_matrix – GeoNetwork opensource Developer website) or, better still, use GeoNetwork instead :-). Neither of those suggestions are very helpful for overcoming the immediate problem though.
Cheers,
Simon
________________________________________
From: ByronCinNZ [cochranes4@anonymised.com]
Sent: Thursday, 30 June 2011 7:50 AM
To: geonetwork-devel@lists.sourceforge.net
Subject: [GeoNetwork-devel] Harvesting from ESRI Geoportal
Hello,
I am testing the ability of GeoNetwork to harvest from sites that use the
ESRI Geoportal Toolkit to present their data via CSW. I am mostly working in
GN 2.6.4 but an also testing against GN 2.7. In GN 2.7 I get a successful
connection with records matched but all being unretrievable. In GN 2.6.4 I
just get an error about faulty constraintLanguage parameter (It needs to be
FILTER).
I am testing against the ESRI supported service
http://gptogc.esri.com/geoportal/csw and the NOAA site
http://gis.ncdc.noaa.gov/geoportal/csw/discovery.
The particulars of my testing inputs are:
Service Url -
http://gis.ncdc.noaa.gov/geoportal/csw/discovery?Request=GetCapabilities&Service=CSW&Version=2.0.2
and
http://gptogc.esri.com/geoportal/csw?request=GetCapabilities&service=CSW&version=2.0.2
Search Criterea / Free Text - %snow%
This should bring back 6 records for either site judging from the results I
get when querying directly on these sites, but I get nothing. In GN 2.7 it
finds 14 records but marks them all as "unretrievable".
In GN 2.6 I get errors about the constraintLanguage parameter being
incorrect. Geoportal only can accept "Filter" as a valid parameter value
here, but I do not know how to make sure that the GN harvester is sending
this parameter.
I tested these sites using the Firefox REST Client I got successful
responses using these parameters:
/<csw:GetRecords xmlns:csw="http://www.opengis.net/cat/csw/2.0.2"
version="2.0.2" service="CSW" resultType="results" > <csw:Query
typeNames="csw:Record" xmlns:ogc="http://www.opengis.net/ogc"
xmlns:gml="http://www.opengis.net/gml">
<csw:ElementSetName>full</csw:ElementSetName> <csw:Constraint
version="1.1.0"> <ogc:Filter> <ogc:And> <ogc:PropertyIsLike wildCard="*"
escape="\" singleChar="?"> <ogc:PropertyName>AnyText</ogc:PropertyName>
<ogc:Literal>Snow</ogc:Literal> </ogc:PropertyIsLike> </ogc:And>
</ogc:Filter> </csw:Constraint> </csw:Query> </csw:GetRecords>/
Can anyone give me some guidance as to what I should do to successfully
retrieve these records using the GeoNetwork CSW harvesting mechanism?
Cheers,
Byron
--
View this message in context: http://osgeo-org.1803224.n2.nabble.com/Harvesting-from-ESRI-Geoportal-tp6530845p6530845.html
Sent from the GeoNetwork developer mailing list archive at Nabble.com.
------------------------------------------------------------------------------
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security
threats, fraudulent activity, and more. Splunk takes this data and makes
sense of it. IT sense. And common sense.
_______________________________________________
GeoNetwork-devel mailing list
GeoNetwork-devel@lists.sourceforge.net
GeoNetwork OpenSource is maintained at http://sourceforge.net/projects/geonetwork