People save web pages to ensure they can retrieve information later without having to load it on the internet. It also is a way of retrieving a web page just in case the original web site has an outage or goes offline for whatever reason.
There are two basic ways of saving web pages, that being via the browser or "printing it" to a PDF.
Via the browser
The browser that has the absolute best web page save feature is Internet Explorer 8, due to the fact it can save entire web pages as a "Web Archive." When you click File/Save As (if you don’t see that in your IE 8, press ALT on your keyboard to bring up that menu,) you’ll see it as a save option:
When you choose to save it will "crunch" everything into a single file:
 
Why is this the best? Because it’s a single file that contains everything (and that’s why it’s labeled as an archive.) All the text, all the images and everything included. If you load it afterward, it looks exactly the way it was originally. It is to the best of my knowledge the only browser that does it right.
Other browsers, such as Firefox, save as "Web page, complete" and it’s nothing but a huge mess. An HTML file will be saved which is the web page, but a subfolder will also be created with all the images, JavaScript files, etc. You can literally get 20+ files out of a single web page save.
Love or hate IE 8, it rules the roost when it comes to web page archiving.
Drawbacks:
- Only one – it’s proprietary to IE 8. Otherwise it’s the best way to archive a web page.
Via PDF Creator
If you don’t use IE 8 and want a web to save web pages a single files that include images and so on, the best way to do this is to use PDF Creator to create PDF files. This is free software that will install a virtual print driver and can be used in your web browser of choice.
Once installed, go to any web page, load it, then click File/Print or press CTRL+P.
Choose PDF Creator from the window that appears:
 
..click OK.
The page will be crunched and made ready for PDF rendering:
 
You’ll see this:
 
Click the Save button at bottom right. You’ll be asked to name the file and where you want to save it to. Once done, the page is archived as a PDF.
Drawbacks:
- Many times the PDF creator will default to a serif font (Times New Roman) instead of the font seen on the original web page.
- Any links in the web page will not work in the PDF.
These drawbacks are usually acceptable being it’s the text you care about the most when it comes to a web page. Any images on the page will be embedded in the PDF; all text is searchable as well.
In addition, the PDF created even for very large web pages will be small in file size, suitable for sending in email if you want to send it off to a friend.
Via ScreenGrab
This is for Firefox only.
ScreenGrab is a FireFox plugin. It allows you to save a PNG or JPEG screen shot of any web page, but does so far better than ALT+PrintScreen. ScreenGrab will take an image of the entire page including the full length. The screen shot taken will look identical to what you see on-screen.
Drawbacks:
- Since the output file is an image, none of the text can be searched and links won’t work either.
- The default output file is a PNG. If the web page you save is very long, the file saved will be enormous.
- On very large web pages it can cause Firefox to freeze up when attempting to take a full screen shot, particularly on slower computers.
You can make the screen shot ScreenGrab takes to be smaller by purposely not using the browser maximized, because yes, ScreenGrab captures everything – including all the white space on the sides.
To use ScreenGrab, install the add-on, then on any web page, right-click and choose ScreenGrab:
 
"Complete Page/Frame" will save the entire page, length and all.
"Visible portion" only captures what the browser is displaying at that moment.
"Selection" allows you to select what you want captured.
"Window" acts like ALT+PrintScreen does.
Choosing to Save will save the file. Choosing to Copy will copy the image to the clipboard buffer where you can paste into another program such as an image editor, Word, etc.
0 comments