Update: SDL TmConvert (or any tool in the SDL App Store) requires users to have bought the license of the corresponding program. Hence this is not accessible for people not holding a license of SDL Trados. I then found these tools for converting SDLTM to TMX, for free:
Trados Studio Resource Converter: http://www.vannellen.com/fortranslators.php
(requires Java JRE 8)
Wordfast Converter: http://wordfast.net/zip/WfConverter.zip
(Required DLL: http://sqlite.org/download.html – file to download: sqlite-dll-win32-x86-*.zip)
Xbench can also convert SDLTM to TMX, however it is more complicated for this task.
I tried Wordfast Converter with SQLite DLL, however it didn’t work for my computer. I tried Trados Studio Resource Converter, with updating Jave Runtime Environment, it works fine.
Paul Filkin wrote about the tool SDL TmConvert that helps to manage the Translation Memories of SDL Studio, eg. converting .SDLTM to the exchangable format .TMX.
SDL TmConvert is sold at: http://appstore.sdl.com/app/sdltmconvert/228/#51222
A free version is limited to 50K segments.
Studio does have some simple features for helping you to maintain your Translation Memories, but there are things you might want to do that go beyond the capabilities of search and replace, creating and editing fields and attributes or exporting subsets or complete copies of your Translation Memories… let’s refer to these as TMs from here on in.
If you wanted to do these sorts of things to help you properly maintain your TMs or use them for other things then you would have to use tools that operate outside of Studio… but not necessarily outside the Studio platform. So I’m talking about theSDL OpenExchange of course. This platform and the capability that is available for anyone who has the knowledge to do their own programming is not provided by anyone else. It really is a well developed and supported tool in it’s own right. A good example of how this can be used is by another Hellenic, although this time a developer and not a dramatist Costas Nadalis has developed four applications for the OpenExchange and is one of the most prolific developers creating applications for others to benefit from:
All of these are really useful applications and all of them are applications that centre around the wisdom of maximising the use of your TMs. The one I want to focus on with this article is SDL TmConvert as this is the most recent application Costas has submitted and it allows you to do all of the things I mentioned at the start and more. The application itself looks like this (the control part of the screen anyway):
You can immediately see this is a very compact application filled with some excellent features, and you’ll also see there is some licensing information too. This application has a free version that allows you to work with TMs containing up to 50,000 Translation Units. To work with TMs greater than that you need the Pro version which is €35… but this is a bargain for the functionality we see.
I’m not going to run through everything this tool can do as there is an excellent help file that explains how to use the features and also what they may be useful for, but I will pick out a couple of things that are interesting for me.
First of all the ability to convert your TM to XLIFF. This is an option you need to be careful with if you’re working through your own TMs because once you do this and create a new TM with a QA’d XLIFF then you will lose all the context information that is held in the TM but not in an XLIFF… so information related to Context Match, TU information and custom fields. But the process of converting your TM to XLIFF files, adding to a project and QA’ing them is not insurmountable… you just need to be careful and Costas kindly provides good information in his help file explaining the process. In a nutshell this is what you’d do.
DO NOT UPDATE THE TM BY CONFIRMING THE TRANSLATION IN THE EDITOR
Another useful thing to do as you prepare the project could be to create a file containing frequently occurring segments:
This option, which you set when you prepare your project will create an SDLXLIFF file that contains segments, or in this case Translation Units, that are duplicated at least twice (I set this to 2). So I can then add this file back into my Project and immediately see whether I can improve this situation a little. So with this TM I had two segments that were duplicate source, and looking at the results I can see this second segment has four duplicate translations, each getting a penalty of 1% so in effect reducing my possibilities for a 100% match:
I could delete the ones I don’t want in the TM results window… quite a neat feature I think.
So an excellent way to QA TMs from others and for checking on the status of your own from time to time. Despite having 600K words in this relatively small TM I would hope that the vast majority would not need too much work, so this process is a very useful one.
The other thing I wanted to mention is related to preparing your TM for use elsewhere. So in creating AutoSuggest Dictionaries, or for training Machine Translation Engines. Neither of these things need to have all the tags and superfluous TU information that you get in an SDLTM. It’s all relevant for translating, but just adds unnecessary processing for other uses. This is where Costas added a really neat feature to this application that allows you to strip out all the superfluous information at the click of a button.
If I take as an example the same TM I have a TU in here that looks like this with quite a lot of tags, system fields and a couple of custom attributes:
If I export this to a TMX through Studio in the normal way I will get a TU that looks like this:
If I use the SDLTmConvert application set up to strip out all of the superfluous information I’m not interested in it looks like this… and took less than 5 seconds to do it (it did take 9 minutes to do the 1.9 million TUs TM I used this on… but the subsequent ASD generation with maximum extraction was possible where it failed before I did this):
That looks much cleaner, and is much more suitable for using to generate my AutoSuggest Dictionary or feed into mynew BeGlobal Trainer for my customised machine translation solution.
Costas really does epitomise what the OpenExchange is all about… providing people with the ability to create applications that can add value to the solutions you have purchased by helping them to work together more efficiently, and at the same time improve the ability to get more use from the resources you already have. I think all the tools he has developed so far have been good, but this one is really good!