Page MenuHomeDevCentral

Analyze Wired
Needs ReviewPublic

Authored by Thibaut120094 on Jun 17 2017, 11:59.
Tags
None
Referenced Files
Unknown Object (File)
Tue, Dec 17, 13:46
Unknown Object (File)
Tue, Dec 17, 10:25
Unknown Object (File)
Mon, Dec 9, 16:05
Unknown Object (File)
Sun, Dec 8, 08:22
Unknown Object (File)
Thu, Dec 5, 08:17
Unknown Object (File)
Wed, Dec 4, 08:09
Unknown Object (File)
Wed, Dec 4, 08:09
Unknown Object (File)
Wed, Dec 4, 07:49
Subscribers
None

Diff Detail

Repository
rSTG Source templates generator
Lint
No Lint Coverage
Unit
No Test Coverage
Branch
site/wired
Build Status
Buildable 1555
Build 1803: arc lint + arc unit

Event Timeline

However, it seems that some pages are compressed with gzip and STG won't be able to parse them without decompressing them first.

thib@debian:~$ wget "https://www.wired.com/story/amazon-whole-foods-acquisition-grocery-shopping/" -O output.html    --2017-06-17 12:13:58--  https://www.wired.com/story/amazon-whole-foods-acquisition-grocery-shopping/
Resolving www.wired.com (www.wired.com)... 151.101.1.63, 151.101.129.63, 151.101.65.63, ...
Connecting to www.wired.com (www.wired.com)|151.101.1.63|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 125202 (122K) [text/html]
Saving to: ‘output.html’

output.html                   100%[================================================>] 122.27K  --.-KB/s   in 0.02s

2017-06-17 12:13:58 (6.20 MB/s) - ‘output.html’ saved [125202/125202]

thib@debian:~$ file output.html
output.html: gzip compressed data, from Unix