Page MenuHomeDevCentral

Analyze Wired
Needs ReviewPublic

Authored by Thibaut120094 on Jun 17 2017, 11:59.
Tags
None
Referenced Files
F3762760: D998.diff
Thu, Nov 21, 14:50
Unknown Object (File)
Wed, Nov 13, 00:22
Unknown Object (File)
Oct 22 2024, 00:55
Unknown Object (File)
Oct 21 2024, 06:01
Unknown Object (File)
Oct 21 2024, 05:29
Unknown Object (File)
Oct 21 2024, 04:52
Unknown Object (File)
Oct 16 2024, 23:07
Unknown Object (File)
Oct 7 2024, 06:33
Subscribers
None

Diff Detail

Repository
rSTG Source templates generator
Lint
No Lint Coverage
Unit
No Test Coverage
Branch
site/wired
Build Status
Buildable 1555
Build 1803: arc lint + arc unit

Event Timeline

However, it seems that some pages are compressed with gzip and STG won't be able to parse them without decompressing them first.

thib@debian:~$ wget "https://www.wired.com/story/amazon-whole-foods-acquisition-grocery-shopping/" -O output.html    --2017-06-17 12:13:58--  https://www.wired.com/story/amazon-whole-foods-acquisition-grocery-shopping/
Resolving www.wired.com (www.wired.com)... 151.101.1.63, 151.101.129.63, 151.101.65.63, ...
Connecting to www.wired.com (www.wired.com)|151.101.1.63|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 125202 (122K) [text/html]
Saving to: ‘output.html’

output.html                   100%[================================================>] 122.27K  --.-KB/s   in 0.02s

2017-06-17 12:13:58 (6.20 MB/s) - ‘output.html’ saved [125202/125202]

thib@debian:~$ file output.html
output.html: gzip compressed data, from Unix