"Lawrence D'Oliveiro" <ldo@nz.invalid> wrote in message news:vbimad$1j26j$6@dont-email.me...
Don't you wonder why people insist on returning "403
Forbidden" for those using a command-line tool like wget?
Most SEO tools and other
useless/criminal scrapers like to fake their identification, and WGet is
a favorite for that task. There are many owners that block that and many other abused user-agents.
(The Ada-Auth.org blocks about 20 user-agents, but not WGet.
On Wed, 11 Sep 2024 23:57:33 -0500, Randy Brukardt wrote:
"Lawrence D'Oliveiro"<ldo@nz.invalid> wrote in message news:vbimad$1j26j$6@dont-email.me...
Don't you wonder why people insist on returning "403
Forbidden" for those using a command-line tool like wget?
Most SEO tools and other
useless/criminal scrapers like to fake their identification, and WGet is
a favorite for that task. There are many owners that block that and many other abused user-agents.
That doesnt make any sense, because anybody who knows how to use wget
would know about its "--user-agent" option. So if they really were using
wget to conduct their site abuse, you wouldnt know, and blocking wgets default user-agent setting wouldnt help.
Wrong.
Why is this wrong?
I run into sites all the time that block the wget user agent, but that I
can retrieve with curl.
On Thu, 12 Sep 2024 19:16:50 -0700, Paul Rubin wrote:
I run into sites all the time that block the wget user agent, but that I
can retrieve with curl.
And I run into sites all the time that block the default wget user agent,
but that I can retrieve with wget.
"Lawrence D'Oliveiro" <ldo@nz.invalid> wrote in message news:vc091i$ljiq$2@dont-email.me...
On Thu, 12 Sep 2024 19:16:50 -0700, Paul Rubin wrote:
I run into sites all the time that block the wget user agent, but that
I can retrieve with curl.
And I run into sites all the time that block the default wget user
agent, but that I can retrieve with wget.
You're confused. The attackers aren't using Wget, but they are
*claiming* to be WGet.
On Sat, 14 Sep 2024 01:27:22 -0500, Randy Brukardt wrote:...
You're confused. The attackers aren't using Wget, but they are
*claiming* to be WGet.
But that long list of user agents being blocked that you previously
mentioned did not include wget.
| Sysop: | DaiTengu |
|---|---|
| Location: | Appleton, WI |
| Users: | 1,104 |
| Nodes: | 10 (0 / 10) |
| Uptime: | 492394:31:56 |
| Calls: | 14,151 |
| Calls today: | 2 |
| Files: | 186,281 |
| D/L today: |
12,113 files (3,852M bytes) |
| Messages: | 2,501,352 |
| Posted today: | 1 |