This site uses cookies.
Some of these cookies are essential to the operation of the site,
while others help to improve your experience by providing insights into how the site is being used.
For more information, please see the ProZ.com privacy policy.
Automatic extraction of individual html pages from a Website
Thread poster: Noemi Carrera
Noemi Carrera Spain Local time: 07:38 Member (2003) English to Spanish
May 9, 2006
Hi everyone,
I need to translate a Website that consists of lots of html pages. The client has not provided us with these pages, just with the .doc files.
I would prefer to work in TagEditor because there is a lot of formatting and tables everywhere and was wondering if there was any software that allows to extract automatically all the individual html pages from a Website.
Thank you very much in advance!
Best regards,
Noemí
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Robert Tucker (X) United Kingdom Local time: 06:38 German to English + ...
wget
May 9, 2006
Originally written for Unix there are now Windows versions. You may want to search the net for a version you like the look of most; I found this one:
There is other software for the task, but I still find wget the easiest to use even though it is command line.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
tlmurray (X) Local time: 01:38 English
Acrobat, others
May 9, 2006
Acrobat (Pro, at least) will dredge through an entire site and make a PDF of each page.
If you're fortunate to have a Mac, Webstractor (softchaos.com) pulls pages into a document that allows editing right there, sort of like viewing a page "in Word". There may be similar tools in Windows.
I noticed you said the client gave you the .doc files. Do you mean that the Web site is made from Word-to-Web, and you have the native docs? Because that sounds like you're home fre... See more
Acrobat (Pro, at least) will dredge through an entire site and make a PDF of each page.
If you're fortunate to have a Mac, Webstractor (softchaos.com) pulls pages into a document that allows editing right there, sort of like viewing a page "in Word". There may be similar tools in Windows.
I noticed you said the client gave you the .doc files. Do you mean that the Web site is made from Word-to-Web, and you have the native docs? Because that sounds like you're home free for translating... ▲ Collapse
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Maria Asis Spain Local time: 07:38 Member (2002) English to Spanish + ...
Translate faster & easier, using a sophisticated CAT tool built by a translator / developer.
Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools.
Download and start using CafeTran Espresso -- for free
Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.