[OSGeo] #3093: Ask web crawlers to not generate Gitea archives

#3093: Ask web crawlers to not generate Gitea archives
----------------------------+----------------------------------------
Reporter: strk | Owner: strk
     Type: task | Status: new
Priority: normal | Milestone: Sysadmin Contract 2024-III
Component: SysAdmin/Gitea | Keywords:
----------------------------+----------------------------------------
To prevent filling up disk.
Example robots.txt for this:
https://gitea.com/robots.txt

Best done via Ansible (#3084)
--
Ticket URL: <https://trac.osgeo.org/osgeo/ticket/3093&gt;
OSGeo <Gter - OSGeo;
OSGeo committee and general foundation issue tracker.

#3093: Ask web crawlers to not generate Gitea archives
----------------------------+-----------------------------------------
Reporter: strk | Owner: strk
     Type: task | Status: new
Priority: normal | Milestone: Sysadmin Contract 2024-III
Component: SysAdmin/Gitea | Resolution:
Keywords: |
----------------------------+-----------------------------------------
Comment (by robe):

You can test out the submodules :slight_smile: cause ultimately this should go in
osgeo7 nginx config for gitea which is stored here -
https://git.osgeo.org/gitea/sac/osgeo7-nginx/src/branch/main/etc/nginx
/sites-available (both git.osgeo.org and gitea.osgeo.org)

oh we should delete that gitea.osgeo.org.save, might be my fault for that
--
Ticket URL: <#3093 (Ask web crawlers to not generate Gitea archives) – OSGeo;
OSGeo <Gter - OSGeo;
OSGeo committee and general foundation issue tracker.