Scheduled Maintenance: We are aware of an issue with Google, AOL, and Yahoo services as email providers which are blocking new registrations. We are trying to fix the issue and we have several internal and external support tickets in process to resolve the issue. Please see: viewtopic.php?t=158230

 

 

 

Download a website with "wget".

If none of the specific sub-forums seem right for your thread, ask here.
Post Reply
Message
Author
hack3rcon
Posts: 746
Joined: 2015-02-16 09:54
Has thanked: 48 times

Download a website with "wget".

#1 Post by hack3rcon »

Hello,
I used below command to download a website completely but some parts like images on the "index.html" not work:

Code: Select all

# wget -m https://ctf101.org/
Any idea?

Thank you.

kedaha
Posts: 3521
Joined: 2008-05-24 12:26
Has thanked: 33 times
Been thanked: 77 times

Re: Download a website with "wget".

#2 Post by kedaha »

hack3rcon wrote:

Code: Select all

# wget -m https://ctf101.org/
Any idea?
Hi,
Not really :wink: but I think, for a start, it should be downloaded as normal user, not as root; otherwise there'll be permissions issues.
DebianStable

Code: Select all

$ vrms

No non-free or contrib packages installed on debian!  rms would be proud.

andreathome
Posts: 9
Joined: 2019-08-25 08:06

Re: Download a website with "wget".

#3 Post by andreathome »

In the past there were quite good tools that could download quite a big part of htm or html based websites.
Nowadays a lot of website are database driving, using CMS front tools.
In my humble opinion it is not possible to just simply download such CMS based websites, certainly not with simple tools.

hack3rcon
Posts: 746
Joined: 2015-02-16 09:54
Has thanked: 48 times

Re: Download a website with "wget".

#4 Post by hack3rcon »

andreathome wrote:In the past there were quite good tools that could download quite a big part of htm or html based websites.
Nowadays a lot of website are database driving, using CMS front tools.
In my humble opinion it is not possible to just simply download such CMS based websites, certainly not with simple tools.
Any tools?

User avatar
oswaldkelso
df -h | grep > 20TiB
df -h | grep > 20TiB
Posts: 1495
Joined: 2005-07-26 23:20
Location: UK
Has thanked: 1 time
Been thanked: 60 times

Re: Download a website with "wget".

#5 Post by oswaldkelso »

Free Software Matters
Ash init durbatulûk, ash init gimbatul,
Ash init thrakatulûk agh burzum-ishi krimpatul.
My oldest used PC: 1999 imac 333Mhz 256MB PPC abandoned by Debian

andreathome
Posts: 9
Joined: 2019-08-25 08:06

Re: Download a website with "wget".

#6 Post by andreathome »

Also in this link:
HTTrack is an offline browser utility, allowing you to download a World Wide website from the Internet to a local directory, building recursively all directories, getting html, images, and other files from the server to your computer.
So no word there on CMS website which work completely different, with databases, sometimes even located elsewhere, so I think most of current websites cannot be downloaded with this tool, only fully classical htm/html based types.

reinob
Posts: 1196
Joined: 2014-06-30 11:42
Has thanked: 99 times
Been thanked: 47 times

Re: Download a website with "wget".

#7 Post by reinob »

hack3rcon wrote:Hello,
I used below command to download a website completely but some parts like images on the "index.html" not work:

Code: Select all

# wget -m https://ctf101.org/
Any idea?

Thank you.
I just tested with wget (check here: https://www.guyrutenberg.com/2014/05/02 ... sing-wget/) and it worked quite nicely. There may be some elements that are still downloaded (such as web fonts) but a quick test with firefox in offline mode shows the page just fine.

(You really really should read the manual of the tools you think are useful, as well as improve your google-fu.)

andre@home
Posts: 398
Joined: 2011-10-02 08:00

Re: Download a website with "wget".

#8 Post by andre@home »

For CMS based web sites as Joomla you need to tweak it to get a reasonable result:
https://vaasa.hacklab.fi/2013/11/28/how ... with-wget/
Needs quite some knowledge on how to convert it into the right way, how to apply the tweaks..

Post Reply