[SAC] [OSGeo] #2705: Download rates from grass.osgeo.org extremely low

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Keywords:
I am trying to download a sample data package from grass.osgeo.org which
takes "forever": 80kb/s (!)

**USA (Portland) --> Germany**

--2022-01-24 16:18:42--
Resolving grass.osgeo.org (grass.osgeo.org)...
Connecting to grass.osgeo.org (grass.osgeo.org)||:443...
HTTP request sent, awaiting response... 200 OK
Length: 166928373 (159M) [application/x-gzip]
Saving to: ‘nc_spm_full_v2alpha2.tar.gz’

nc_spm_full_v2alpha2.tar.gz 0%[ ] 551.71K 79.6KB/s
eta 34m 4s

As a comparison, I get 6Mb/s from the mirror site in South Africa:

**South-Africa --> Germany**

mneteler@caddy: ~/tmp$ wget
--2022-01-24 16:28:49--
Resolving grass.mirror.ac.za (grass.mirror.ac.za)... 2001:4200:fffc::103,
Connecting to grass.mirror.ac.za
(grass.mirror.ac.za)|2001:4200:fffc::103|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 166928373 (159M) [application/octet-stream]
Saving to: ‘nc_spm_full_v2alpha2.tar.gz.1’

nc_spm_full_v2alpha2.tar.gz.1 21%[===========> ] 34.62M 6.44MB/s
eta 46s

Is there anything throttling the connectivity on the `grasslxd` container?
Esp. versus Europe?

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by robe):

No no throttling. checking now to see how it is from here.

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:1&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by robe):

Confirmed pretty slow from Boston too.

nc_spm_full_v2alpha2.tar.gz 4%[==>
] 7.58M 287KB/s eta 10m 3s


and download is slow as well so general issue with osgeo7.

Going to check the other servers.

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:2&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by robe):

Does seem specific to osgeo7.

I did a compare of osgeo4

git clone https://dev.git.osgeo.org/gitea/postgis/postgis.git postgis-test

100% (106594/106594), 53.91 MiB | 15.30 MiB/s, done.

vs. osgeo7

remote: Total 109688 (delta 86395), reused 106890 (delta 83727) receiving
objects: 100% (109688/109688),

Receiving objects: 100% (109688/109688), 56.17 MiB | 520.00 KiB/s, done.

osgeo7 is due for a reboot so I'll do that later today and will
investigate further if something specific is eating the bandwidth.

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:3&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by robe):

I installed vnstat on the different servers to get a sense of how much
traffic each gets.
It's still building up stats, but already I see osgeo7 does get 10 times
more traffic than osgeo3 (and osgeo4 gets very little comparatively).

I think there is a limit per server just by the share ethernet card alone,
but there might be a limit set also by OSUOSL. I will ask OSUOSL about

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:4&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by neteler):

Thanks, @robe, for inspecting this. It is an issue I have observed
multiple times and for many months (just only today I wrote this ticket).

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:5&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by neteler):

Right now this takes "forever" (80 kb/s):

wget http://download.osgeo.org/gdal/3.4.1/gdal-3.4.1.tar.gz
gdal-3.4.1.tar.gz 63%[===========================> ] 12.11M

(download server --> Bonn, Germany)

Anything which could be done about it? Thanks.

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:6&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by robe):


Sorry about that. I'm planning to round-robin download this month (across
osgeo8 and osgeo4). But still testing some stuff out before I do. So the
load would be split across servers. I know for example osgeo4 is pretty
fast. So it's just the terabytes of traffick osgeo7 is going under.

One change I have to do before then is make sure everyone uploads to
upload.osgeo.org instead of download.osgeo.org. I'll send out a note in a
week or so about that.

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:7&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by robe):


I discovered I can make a light nginx proxy on the other servers without
having to pull over all the data since the speed between servers is fast.
I tried downloading.

wget http://download-cache.osgeo.org/gdal/3.4.1/gdal-3.4.1.tar.gz

and it was about 11MB/s

This is just a temporary name

My plan is to balance the traffic on download.osgeo.org across the servers
and then eventually have some cdns too.

I can't switch download yet since I think a lot of the rsyncs are set to
use download.osgeo.org and many people are using the download.osgeo.org
name for uploading. I need to have everyone use upload.osgeo.org instead
for uploading. Similar can be done with other sites like live.osgeo.org
and grass.osgeo.org as long as you aren't using that name for rsync. To
do it rsync would need to use a different name.

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:8&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by robe):

Okay I added osgeo9 as a backup for download.osgeo.org. So it may not be
as fast anymore if you test. I'm monitoring it very closely and will also
investigate if we need to kill some traffic coming to download.osgeo.org

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:9&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by neteler):

What's not clear to me: my report refers to grass.osgeo.org (=
osgeo7.osgeo.org), isn't that a different box than download.osgeo.org?

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:10&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.

#2705: Download rates from grass.osgeo.org extremely low
Reporter: neteler | Owner: sac@…
     Type: defect | Status: new
Priority: normal | Milestone: Unplanned
Component: Systems Admin | Resolution:
Keywords: |

Comment (by robe):

The are both on the same host osgeo7 but different containers. They use
the same ip.

Ticket URL: <https://trac.osgeo.org/osgeo/ticket/2705#comment:11&gt;
OSGeo <https://osgeo.org/&gt;
OSGeo committee and general foundation issue tracker.