[GeoNetwork-users] Harvesting EUMETSAT data

Hi,

I'm trying to harvest the data provided by EUMETSAT's data center. Their
documentation states that their via their product navigator is OGC CSW
2.0.2 compliant and the capabilities can be harvested via
http://navigator.eumetsat.int/soapServices/CSWStartup?service=CSW&version=2.0.2&request=GetCapabilities

I've set this up and the harvest process seems to run with no errors,
but it doesn't get any records. Trying the URL in a browser seems to
return the appropriate XML. Has anybody got this to work, I don't know
what to try next.

--

Andy

--------------------------------------------------------------------------------
Plymouth Marine Laboratory

Registered Office:
Prospect Place
The Hoe
Plymouth PL1 3DH

Website: www.pml.ac.uk
Click here for PML Annual Review
Registered Charity No. 1091222
PML is a company limited by guarantee
registered in England & Wales
company number 4178503

Please think before you print

--------------------------------------------------------------------------------
This e-mail, its content and any file attachments are confidential.

If you have received this e-mail in error please do not copy, disclose it to any third party or use the contents or attachments in any way. Please notify the sender by replying to this e-mail or e-mail forinfo@anonymised.com and then delete the email without making any copies or using it in any other way.

The content of this message may contain personal views which are not the views of Plymouth Marine Laboratory unless specifically stated.

You are reminded that e-mail communications are not secure and may contain viruses. Plymouth Marine Laboratory accepts no liability for any loss or damage which may be caused by viruses.
--------------------------------------------------------------------------------

Hi Andy,

The problem is:

The process of GeoNetwork CSW harvesting is: 1) getRecords 2)parse the getRecords response, get the fileIdentifier information 3) getRecordById based on the fileIdentifier

But in the EUMETSAT, the GetRecords response as below:

......
<gmd:MD_Metadata xmlns:gmd="http://www.isotc211.org/2005/gmd&quot;&gt;
<gmd:fileIdentifier xmlns:eum="http://www.eumetsat.int/2008/gmi&quot; xmlns:gco="http://www.isotc211.org/2005/gco&quot; xmlns:gmi="http://www.isotc211.org/2005/gmi&quot; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance&quot;&gt;&lt;gco:CharacterString&gt;EO:EUM:DAT:MULT:DMPGRAPH&lt;/gco:CharacterString&gt;
</gmd:fileIdentifier>
......

The gco namespace is defined in the fileIdentifier field rather in the MD_Metadata field. Hence, there will be an error to parse it to get the fileIdentifier: XPath error while evaluating "gmd:fileIdentifier/gco:CharacterString":

Our solution is: add the gco namespace to /kernel/harvest/csw/Harvester.java

getRecordInfo() Function

[Available] Kai Liu
  
show details 5:46 PM (5 minutes ago)
  
else if (name.equals("MD_Metadata")) {
            try {
               
                XPath xpath = XPath.newInstance("gmd:fileIdentifier/gco:CharacterString");
                //add gco namespac by cisc.gmu.edu
                xpath.addNamespace("gco", "http://www.isotc211.org/2005/gco&quot;\);
                //System.out.println(Xml.getString(record));
                Element identif = (Element) xpath.selectSingleNode(record);
.....

Cheers,

Kai
Joint Center for Intelligent Spatial Computing
703-395-2337

----- Original Message -----
From: Andy Horseman <anho@anonymised.com>
Date: Wednesday, October 19, 2011 10:55 am
Subject: [GeoNetwork-users] Harvesting EUMETSAT data

Hi,

I'm trying to harvest the data provided by EUMETSAT's data center.
Theirdocumentation states that their via their product navigator is
OGC CSW
2.0.2 compliant and the capabilities can be harvested via
http://navigator.eumetsat.int/soapServices/CSWStartup?service=CSW&version=2.0.2&request=GetCapabilities

I've set this up and the harvest process seems to run with no errors,
but it doesn't get any records. Trying the URL in a browser seems to
return the appropriate XML. Has anybody got this to work, I don't know
what to try next.

--

Andy

--------------------------------------------------------------------
------------
Plymouth Marine Laboratory

Registered Office:
Prospect Place
The Hoe
Plymouth PL1 3DH

Website: www.pml.ac.uk
Click here for PML Annual Review
Registered Charity No. 1091222
PML is a company limited by guarantee
registered in England & Wales
company number 4178503

Please think before you print

--------------------------------------------------------------------
------------
This e-mail, its content and any file attachments are confidential.

If you have received this e-mail in error please do not copy,
disclose it to any third party or use the contents or attachments
in any way. Please notify the sender by replying to this e-mail or
e-mail forinfo@anonymised.com and then delete the email without making
any copies or using it in any other way.

The content of this message may contain personal views which are
not the views of Plymouth Marine Laboratory unless specifically
stated.
You are reminded that e-mail communications are not secure and may
contain viruses. Plymouth Marine Laboratory accepts no liability
for any loss or damage which may be caused by viruses.
--------------------------------------------------------------------
------------

--------------------------------------------------------------------
----------
All the data continuously generated in your IT infrastructure
contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and
makessense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2d-oct
_______________________________________________
GeoNetwork-users mailing list
GeoNetwork-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/geonetwork-users
GeoNetwork OpenSource is maintained at
http://sourceforge.net/projects/geonetwork

Thanks Kai, that makes sense.
I downloaded and built a copy of the GeoNetwork source to test your
suggestion, but the problem seems to have already been fixed in the
latest version.

--

Andy

On 19/10/11 22:54, Kai Liu wrote:

Hi Andy,

The problem is:

The process of GeoNetwork CSW harvesting is: 1) getRecords 2)parse the getRecords response, get the fileIdentifier information 3) getRecordById based on the fileIdentifier

But in the EUMETSAT, the GetRecords response as below:

......
<gmd:MD_Metadata xmlns:gmd="http://www.isotc211.org/2005/gmd&quot;&gt;
<gmd:fileIdentifier xmlns:eum="http://www.eumetsat.int/2008/gmi&quot; xmlns:gco="http://www.isotc211.org/2005/gco&quot; xmlns:gmi="http://www.isotc211.org/2005/gmi&quot; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance&quot;&gt;&lt;gco:CharacterString&gt;EO:EUM:DAT:MULT:DMPGRAPH&lt;/gco:CharacterString&gt;
</gmd:fileIdentifier>
......

The gco namespace is defined in the fileIdentifier field rather in the MD_Metadata field. Hence, there will be an error to parse it to get the fileIdentifier: XPath error while evaluating "gmd:fileIdentifier/gco:CharacterString":

Our solution is: add the gco namespace to /kernel/harvest/csw/Harvester.java

getRecordInfo() Function

[Available] Kai Liu
  
show details 5:46 PM (5 minutes ago)
  
else if (name.equals("MD_Metadata")) {
            try {
               
                XPath xpath = XPath.newInstance("gmd:fileIdentifier/gco:CharacterString");
                //add gco namespac by cisc.gmu.edu
                xpath.addNamespace("gco", "http://www.isotc211.org/2005/gco&quot;\);
                //System.out.println(Xml.getString(record));
                Element identif = (Element) xpath.selectSingleNode(record);
.....

Cheers,

Kai
Joint Center for Intelligent Spatial Computing
703-395-2337

----- Original Message -----
From: Andy Horseman <anho@anonymised.com>
Date: Wednesday, October 19, 2011 10:55 am
Subject: [GeoNetwork-users] Harvesting EUMETSAT data

Hi,

I'm trying to harvest the data provided by EUMETSAT's data center.
Theirdocumentation states that their via their product navigator is
OGC CSW
2.0.2 compliant and the capabilities can be harvested via
http://navigator.eumetsat.int/soapServices/CSWStartup?service=CSW&version=2.0.2&request=GetCapabilities

I've set this up and the harvest process seems to run with no errors,
but it doesn't get any records. Trying the URL in a browser seems to
return the appropriate XML. Has anybody got this to work, I don't know
what to try next.

--

Andy

--------------------------------------------------------------------
------------
Plymouth Marine Laboratory

Registered Office:
Prospect Place
The Hoe
Plymouth PL1 3DH

Website: www.pml.ac.uk
Click here for PML Annual Review
Registered Charity No. 1091222
PML is a company limited by guarantee
registered in England & Wales
company number 4178503

Please think before you print

--------------------------------------------------------------------
------------
This e-mail, its content and any file attachments are confidential.

If you have received this e-mail in error please do not copy,
disclose it to any third party or use the contents or attachments
in any way. Please notify the sender by replying to this e-mail or
e-mail forinfo@anonymised.com and then delete the email without making
any copies or using it in any other way.

The content of this message may contain personal views which are
not the views of Plymouth Marine Laboratory unless specifically
stated.
You are reminded that e-mail communications are not secure and may
contain viruses. Plymouth Marine Laboratory accepts no liability
for any loss or damage which may be caused by viruses.
--------------------------------------------------------------------
------------

--------------------------------------------------------------------
----------
All the data continuously generated in your IT infrastructure
contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and
makessense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2d-oct
_______________________________________________
GeoNetwork-users mailing list
GeoNetwork-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/geonetwork-users
GeoNetwork OpenSource is maintained at
http://sourceforge.net/projects/geonetwork

--------------------------------------------------------------------------------
Plymouth Marine Laboratory

Registered Office:
Prospect Place
The Hoe
Plymouth PL1 3DH

Website: www.pml.ac.uk
Click here for PML Annual Review
Registered Charity No. 1091222
PML is a company limited by guarantee
registered in England & Wales
company number 4178503

Please think before you print

--------------------------------------------------------------------------------
This e-mail, its content and any file attachments are confidential.

If you have received this e-mail in error please do not copy, disclose it to any third party or use the contents or attachments in any way. Please notify the sender by replying to this e-mail or e-mail forinfo@anonymised.com and then delete the email without making any copies or using it in any other way.

The content of this message may contain personal views which are not the views of Plymouth Marine Laboratory unless specifically stated.

You are reminded that e-mail communications are not secure and may contain viruses. Plymouth Marine Laboratory accepts no liability for any loss or damage which may be caused by viruses.
--------------------------------------------------------------------------------