[SAC] [OSGeo] #173: mailman archives setup to discourage search engine crawlers?

#173: mailman archives setup to discourage search engine crawlers?
----------------------+-----------------------------------------------------
Reporter: warmerdam | Owner: sac@lists.osgeo.org
    Type: task | Status: new
Priority: normal | Component: SAC
Keywords: mailman |
----------------------+-----------------------------------------------------
I noticed that the OSGeo mailing list archives disallow indexing with
the following code bit:
{{{
      <META NAME="robots" CONTENT="noindex,follow">
}}}
at this URL: http://lists.osgeo.org/pipermail/gdal-dev/

Could you remove the "noindex" bit, or at least forward this on to the
owner of the archives in general so they can do so?

--
Ticket URL: <http://trac.osgeo.org/osgeo/ticket/173&gt;
OSGeo <http://www.osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#173: mailman archives setup to discourage search engine crawlers?
------------------------+---------------------------------------------------
  Reporter: warmerdam | Owner: sac@lists.osgeo.org
      Type: task | Status: new
  Priority: normal | Component: SAC
Resolution: | Keywords: mailman
------------------------+---------------------------------------------------
Comment (by warmerdam):

I have reviewed the "Archiving Options" page for mailman and didn't notice
any obvious way of controlling this setting. Anyone else have ideas?
Does the <META> tag in question really tell search engines not to catalog
our archives?

--
Ticket URL: <http://trac.osgeo.org/osgeo/ticket/173#comment:1&gt;
OSGeo <http://www.osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#173: mailman archives setup to discourage search engine crawlers?
------------------------+---------------------------------------------------
  Reporter: warmerdam | Owner: sac@lists.osgeo.org
      Type: task | Status: new
  Priority: normal | Component: SAC
Resolution: | Keywords: mailman
------------------------+---------------------------------------------------
Changes (by neteler):

* cc: neteler (added)

Comment:

Yes, noindex says to not archive (please change that...).

e.g.
http://www.robotstxt.org/meta.html

http://www.google.com/search?q=site:mail.python.org+mailman+meta+noindex

--
Ticket URL: <http://trac.osgeo.org/osgeo/ticket/173#comment:2&gt;
OSGeo <http://www.osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#173: mailman archives setup to discourage search engine crawlers?
------------------------+---------------------------------------------------
  Reporter: warmerdam | Owner: sac@lists.osgeo.org
      Type: task | Status: new
  Priority: normal | Component: SAC
Resolution: | Keywords: mailman
------------------------+---------------------------------------------------
Comment (by jbirch):

Yes, but it's OK because the individual message pages say:

{{{
#!text/html
      <META NAME="robots" CONTENT="index,nofollow">
}}}

Mailman has it right; we only want the engines to index the "meat"

--
Ticket URL: <http://trac.osgeo.org/osgeo/ticket/173#comment:3&gt;
OSGeo <http://www.osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#173: mailman archives setup to discourage search engine crawlers?
------------------------+---------------------------------------------------
  Reporter: warmerdam | Owner: sac@lists.osgeo.org
      Type: task | Status: closed
  Priority: normal | Component: SAC
Resolution: invalid | Keywords: mailman
------------------------+---------------------------------------------------
Changes (by warmerdam):

  * status: new => closed
  * resolution: => invalid

Comment:

Thanks Jason, I think I understand now. Closing...

--
Ticket URL: <http://trac.osgeo.org/osgeo/ticket/173#comment:4&gt;
OSGeo <http://www.osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.