Pages in topic:   [1 2] >
How to copy all terms from a personal glossary?
Thread poster: Mooyae Choi
Mooyae Choi
Mooyae Choi  Identity Verified
China
Local time: 18:55
Member (2010)
English to Korean
+ ...
Nov 18, 2010

Hello
I wanna copy all words of my fields from the terminology colum personal glossary.
Who can tell me what should I do?I need it badly.
Thanks


 
Natalie
Natalie  Identity Verified
Poland
Local time: 11:55
Member (2002)
English to Russian
+ ...

MODERATOR
SITE LOCALIZER
More information, please! Nov 19, 2010

What exactly are you trying to copy, where, in which program, with what purpose?

Please try to describe your problem clearly, otherwise nobody will be able to help you.



[Edited at 2010-11-19 00:49 GMT]


 
Mooyae Choi
Mooyae Choi  Identity Verified
China
Local time: 18:55
Member (2010)
English to Korean
+ ...
TOPIC STARTER
Thanks Nov 19, 2010

Thank for you response.
I want copy the glossaries in the (other terminology resourceds) in the Terminology column. There so many words I need badly.Now do you understand me now?
So how can I copy or import those glossaries?
Thanks


 
Mooyae Choi
Mooyae Choi  Identity Verified
China
Local time: 18:55
Member (2010)
English to Korean
+ ...
TOPIC STARTER
url Nov 19, 2010

http://www.proz.com/?sp=mt&eid_s=80987&glossary=14606
I want to copy those words in the above url.
Is it possible?
Thanks


 
Tomás Cano Binder, BA, CT
Tomás Cano Binder, BA, CT  Identity Verified
Spain
Local time: 11:55
Member (2005)
English to Spanish
+ ...
We have to insist!! Nov 19, 2010

What software are you talking about!?

We cannot help you if you don't tell us. There are at least 20 different CAT tools out there!


 
Philippe Etienne
Philippe Etienne  Identity Verified
Spain
Local time: 11:55
Member
English to French
He wants to copy a glossary hosted on proz.com Nov 19, 2010

But I don't think you can.
You can search it but not copy the whole list.

The alternative would be to ask politely the owner of the glossary if s/he would be ready to share his/her assets.

The question shouldn't be in the CAT tools help section, rather something related to proz.com itself/resources/terminology/what-not.
There is nothing about CAT tools in your question if I understood it well.

Philippe


 
Samuel Murray
Samuel Murray  Identity Verified
Netherlands
Local time: 11:55
Member (2006)
English to Afrikaans
+ ...
Not directly, unfortunately Nov 19, 2010

kenneth-cui wrote:
http://www.proz.com/?sp=mt&eid_s=80987&glossary=14606
I want to copy those words in the above url.


Unfortunately, ProZ.com does not seem to provide a glossary export feature. This means that you'd have to visit each glossary page and copy the terms manually in some or other way.

The issue is further compounded by the Korean characters, which in the HTML code are entitised (i.e. Proz.com's HTML pages do not have the actual Korean characters in it, even though this is technically possible).

One option is to save each glossary page and open it in MS Word or OOo Writer and then select the table and copy it to another program, such as Excel, but that is going to take a very long time. MS Word takes a very long time to open just one of these glossary pages. Ooo Writer is quicker when opening a glossary page, but OOo Writer's table copying feature is far more primitive too.

I agree with Tomás and Natalie that we need to know which CAT tool you're using (if any).


 
Mooyae Choi
Mooyae Choi  Identity Verified
China
Local time: 18:55
Member (2010)
English to Korean
+ ...
TOPIC STARTER
Thanks Nov 19, 2010

Thank for all of your responses.
I have to do it manually.
Have a good days


 
Samuel Murray
Samuel Murray  Identity Verified
Netherlands
Local time: 11:55
Member (2006)
English to Afrikaans
+ ...
Doing it manually Nov 19, 2010

kenneth-cui wrote:
Thank for all of your responses.
I have to do it manually.


I've tried a number of html2rtf converters, and I have found that this one:
http://www.sautin.com/download/rtftohtml.zip
...gives the best results (if you deselect the "tables" option).

The free version only converts 30 000 characters, so you won't get much of a result with the free version unless you edit the HTML page by removing superfluous top-matter (i.e. everything before "Glossary name") and optionally bottom-matter (i.e. everything before {strong}1{/strong}).

With a little tweaking (okay, more than a little, perhaps), you can get this:
http://vertaal.org/tempfile/glossaryconversion.zip


 
Natalie
Natalie  Identity Verified
Poland
Local time: 11:55
Member (2002)
English to Russian
+ ...

MODERATOR
SITE LOCALIZER
Moving the thread Nov 19, 2010

to ProZ.com technical support forum.

 
FarkasAndras
FarkasAndras  Identity Verified
Local time: 11:55
English to Hungarian
+ ...
wget Nov 19, 2010

Samuel Murray wrote:

kenneth-cui wrote:
http://www.proz.com/?sp=mt&eid_s=80987&glossary=14606
I want to copy those words in the above url.


Unfortunately, ProZ.com does not seem to provide a glossary export feature. This means that you'd have to visit each glossary page and copy the terms manually in some or other way.


Not necessarily, automating this with wget is a trivial task (urls are rule-based). Then one could merge the html files, e.g. with copy *.html all.html in the windows console, then convert the html to text. Or, probably even better, extract the relevant bits from the raw html with a regex. Of course these steps are not that trivial...


 
Samuel Murray
Samuel Murray  Identity Verified
Netherlands
Local time: 11:55
Member (2006)
English to Afrikaans
+ ...
@Farkas... Nov 19, 2010

FarkasAndras wrote:
Samuel Murray wrote:
This means that you'd have to visit each glossary page and copy the terms manually in some or other way.

Not necessarily, automating this with wget is a trivial task (urls are rule-based).


Yes, of course, it doesn't matter how you get the HTML pages... whether you do it manually or using a script or wget or whatnot, but you'll end up with a bunch of HTML files that need to be converted and merged (or merged and converted... depending on which is better).

...then convert the html to text.


Do you have a free HTML2TXT converter that will successfully convert the OP's pages? Most of the ones I've tried does not convert entities to Unicode characters. I guess one could build an entity2unicode routine into the conversion procedure, but that would be one more step.

Or, probably even better, extract the relevant bits from the raw html with a regex. Of course these steps are not that trivial...


Yes, there are many ways to skin a cat.



[Edited at 2010-11-19 15:57 GMT]


 
FarkasAndras
FarkasAndras  Identity Verified
Local time: 11:55
English to Hungarian
+ ...
Sort of, yes Nov 19, 2010

Samuel Murray wrote:

Do you have a free HTML2TXT converter that will successfully convert the OP's pages? Most of the ones I've tried does not convert entities to Unicode characters. I guess one could build an entity2unicode routine into the conversion procedure, but that would be one more step.


I don't know how they work on these particular pages, but I have a good HTML->txt tool, I needed one for the aligner. Eventually, I'll polish it up drop one of it in a standalone windows .exe and publish it on sourceforge.
If you need stuff like this and you know a bit of perl, the HTML::Strip and HTML::Parser modules are what I use (HTML::Entities within HTML::Parser, to be precise). HTML::Entities does the HTML entity conversion into utf-8. Of course I have no idea about how well it handles Korean, but it should do the job in principle. Also, you could always graft a supplementary converter onto that quite easily - I already have, in fact. It's just a matter of cannibalizing the code from the aligner.


 
Samuel Murray
Samuel Murray  Identity Verified
Netherlands
Local time: 11:55
Member (2006)
English to Afrikaans
+ ...
My solution, for the OP Nov 19, 2010

Hey Kenneth

If you have a real operating system (i.e. Windows), try this:
http://vertaal.org/tempfile/prozglossmerge.zip (there is no virus)

Steps:
1. Download the glossary HTML files
2. Put this little program in the same folder as those files
3. Run the program
4. Open the resulting HTML file in your word processor, copy the middle c
... See more
Hey Kenneth

If you have a real operating system (i.e. Windows), try this:
http://vertaal.org/tempfile/prozglossmerge.zip (there is no virus)

Steps:
1. Download the glossary HTML files
2. Put this little program in the same folder as those files
3. Run the program
4. Open the resulting HTML file in your word processor, copy the middle columns, and paste in a spreadsheet.
or...
4. Run the resulting HTMl file through an HTML2TXT program.



[Edited at 2010-11-19 13:56 GMT]
Collapse


 
Mooyae Choi
Mooyae Choi  Identity Verified
China
Local time: 18:55
Member (2010)
English to Korean
+ ...
TOPIC STARTER
Thank you very much Nov 19, 2010

Dear Samuel
You do great help to me.
Thank you so much.
And have a good day.


 
Pages in topic:   [1 2] >


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

How to copy all terms from a personal glossary?






Trados Studio 2022 Freelance
The leading translation software used by over 270,000 translators.

Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop and cloud solution, empowering you to work in the most efficient and cost-effective way.

More info »
TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »