Translation tools: April 2009

Thursday, April 30, 2009

MemoQ 3.5.15 - just some smallish notes

I'm not quite experienced with MemoQ. I used to test it a year ago, but didn't really perform too many jobs in it.

Anyway, after attending MemoQFest and hearing so many words about how great it is (and I did like it when I tested), I decided to eventually seriously test it. I was a kind of disappointed to get MemoQ4Free trial license which is apparently because I have this serial number for more than a year. And I didn't have a chance to contact MemoQ people asking for a more capable trial version. Will do.

So, I decided to test by translating MemoQ help files. But all I could do with this license was to translate onl one file because I could only have one file per project. There was no point to create several single-file projects as they are not allowed tro share a translation memory anyway. So, before I really start testing I'll have to eventually send my thanks and appreciations for the event to Kilgray.

However, I noticed a couple of interesting facts (drawbacks indeed as I always notice drawbacks). They may be of no importance for many users, but are sometimes annoying for me. And I don't know if they can be corrected by setting some options or not, I couldn't find ways to do that in 5 minutes, so I postponed searching.

So, those two facts are:

1. If a segment that is already in your TM is a part of another segment (in my case it was the name of a dialog box: the first segment contained the name itself, and some other segments referred to this name then) it automatically suggests you the translation. This is indeed a ver nice feature, not drawback. What is to my mind a drawback is that if ou select to insert this translation, and if it was included into tags in the segment being transalted, the tags disappear. So, you have to paste them from source text or cope and paste the translation manually.

2. This one also relates to tags. Differnet people have different work habits, and I prefer to have source text copied into the target, then select some number of words and replace them with the translation (just type it instead of the selected text). I select the text using Ctrl-Shift-arrows on the keyboard. And again, if there is a tag in the the text, and if I try to select the text before the tag, it includes the tag into the selection as well. So if I just retype, the tag is deleted. And if I want to preserve the tag, I need to press Shift-Left arrow twice to move to the end of the word preceding the text. This is something that really poisoned my life with MemoQ.

Friday, April 24, 2009

Is Web 2.0 really here?

I'm currently sitting at MemoQFest, Kilgray's first user conference, but this questions interests me for much longer, and especially at each conference I attend.

People are speaking about Web collaborations - that's OK and possible now even in Russia where some freelancers still use analog modems to connect to the Internet. But people are also speaking about submitting online requests for quotations etc. Nothing new indeed, translation portals exist for ages, and what people are talking about now is just tools that work behind the portal to automate workflows. But are our clients really that mature?

Well, some of them are. They use their own workflow systems which upload project files to their FTP servers and automatically generate e-mail notifications to our project managers. How can we connect it with our Web-based workflow management system? The answer I just heard was to have project managers enter this information into the portal form instead of the client. This indeed will work, but isn't it really ridiculous from the technology point of view to have project managers do something manually in order to have it then processed automatically while both steps can be done automatically?

Clients that are not that mature are often secure. They do care about their information, and they often have a list of software and procedures they can use and a list of those they are not allowed to use at the office. File transfer via a portal is often on such "black lists".

I'm not even going to touch the clients who are just tough and wouldn't want to change the manner of their work just because some technology requires it. Later on they can fall in love with translation portals, but it will take a year or more for them. Do we have just to drop such clients off?

I really wonder what other LSPs think about it. I agree that we can be strict when it comes to payment terms, pricing issues or other things like that because this is what our clients owe us. But as soon as it concerns the way we provide services to our clients, i.e. what we owe them - do we have to push the clients and make them accept our work style? I think, no. That's why I'm still looking for the translation management system that would do everything such systems offer now and will provide integration with customers systems and/or e-mail service.

Sunday, April 19, 2009

QA functionality in XBench 2.7

XBench 2.7 (build 0.183)

Supported checks. This tool officially does not support Unicode, and this is probably the main drawback of the application that eventually resulted in a rather high error level. For this reason, it does not support checking Arabic, Chinese, Farsi as well as Czech and Polish TTX files.
It does not have punctuation checks enabled by default; however, they are easy to enable via XBench rules which may significantly improve its error reporting in real life.
False positives. XBench reported corrupt characters for all non-Latin and non-Cyrillic characters.
Multilingual project support. Like many other QA tools, this one finds inconsistencies between translations into different languages and reports terminology errors in untranslated segments.
Conclusion. This tool is very new to the market, but probably on of the most promising ones to date. Its extensive file format support, additional functionality and extension capabilities together with the fact that the tool is currently free allow to suppose many companies, particularly small ones, may want to select it as their QA solution.

Thursday, April 16, 2009

QA functionality in QA Distiller 6.0.0

QA Distiller™ 6.0.0, build 188

Supported checks. This tool supports the widest number of possible checks which still may be extended using regular expressions. However, a serious drawback is that it does not check tags identity. The test file included a hyperlink which was intentionally changed in all translations; however, QA Distiller ignored it.
Additionally, we couldn’t find a way to check untranslatables in Distiller. There are two places where you can set a list of untranslatable items, and our idea was that the tool should make sure they are identical in source and target text. However, Distiller did not report missing untranslatable which existed in the test file.
Another weak point is number formatting check. Distiller only makes sure the number includes separators specified in parameters of the target language, but does not check the order of the separators. So, for example, it will consider 1,222.33 and 1.222,33 to be the same numbers with regard to number formatting.
Multilingual project support. In addition to reporting inconsistencies between different languages, Distiller also handles multilingual batches together with multilingual dictionaries in a strange way. For the first file it encounters, it tries to match all the glossary files in spite of the language indicated in the translated file and the glossary, which results in numerous “ignored terminology” errors. For the second target language, it tries to match it to all the glossaries until it finds the correct one. Then it perfectly matches the rest of the translated files with correct glossaries and doesn’t generate error messages.
Right-to-left language support. While this is the most comprehensive QA tool so far, it definitely lacks right-to-left languages support. Sentences in those languages are still aligned left-to-right, and if a segment ends with non-Arabic/Farsi/Hebrew words or digits, QA Distiller often handles the end of the segment incorrectly which results in a false error message. Additionally, it reports terminology errors in almost every segment because of incorrect RTL text handling. If you open the same file in MS Word and do a simple search for the glossary term you will be able to locate it easily while Distiller insists the term translation is missing.
It does not support Farsi by default, so we had to define a new language which resulted in reporting too many corrupt characters (480 occurrences).
Additional observations. Inability to change error severity may also be considered as a disadvantage (at least it makes the software less flexible and customisable).
Conclusion. At the moment, this is the most comprehensive, yet rather expensive standalone solution on the market.

Monday, April 13, 2009

QA functionality in ErrorSpy 4.0

ErrorSpy 4.0, build 001

Supported checks. The total set of supported checks is quite extensive with some specifics listed below. ErrorSpy includes presets for some languages, but they are sometimes incorrect (e.g. incorrect quotation marks for French). It does not support Chinese Traditional as well as right-to-left languages without additional customisation and does not support specifying more than one set of quotation marks in case of nesting. Although it allows to specify decimal and thousand separators to check number formatting, we failed to make it check it. In fact it only reported unmatched figures.
File import. The tool cannot check for skipped segments because it does not import skipped segments at all. This is not convenient if you need to locate any segments that were left untranslated.
False positives. ErrorSpy reported “space required after punctuation mark” errors even if the corresponding checkbox is deselected.
Right-to-left language support. No support by default; however, the tool allows to create new languages. For Arabic, it reports Latin characters to be punctuation marks although they were listed as valid characters in language configuration.
Additional observations. English user interface contains translation errors (for example, one of the checkboxes reads: “spaces that require a space before”).
Another drawback is that ErrorSpy sometimes corrupts the first letter of language names.
This tool does not remember the directory it recently worked with. It is quite inconvenient when you work in a non-default directory.
For some reason, more and more interface elements switch to German with each test run. The interface gets back to English after restart.
The tool crashed on each attempt to check Russian. This may be a problem of this particular installation, but may also be a problem of the whole release. It must be noted, however, that version 3 ran smoothly on the same computer.
Conclusion. Although the tool significantly improved compared to its version 3, the first build seems to be quite unstable. However, with such rapid progress, this tool is rather promising.

Sunday, April 12, 2009

QA functionality in Wordfast

Wordfast version 5.51t3

Supported checks. The amount of checks supported is rather limited, but may be extended using custom macros.
File import. Wordfast determines HTML files by <html> tag at the beginning, but not by the real content, whereas in real life this tag may often be omitted.
Multilingual project support. We failed to make it check terminology against the correct glossary. After checking the Arabic test file, Wordfast continued to apply Arabic glossary to the rest of the languages despite of numerous setup changes, glossary recreation, deletion etc. Even when there was no Arabic glossary existing on the computer, Wordfast still reported Arabic terminology errors. It might have got much better scores if it used correct glossaries.
Right-to-left language support. It doesn’t properly handle right-to-left languages and just like all other tools reports terminology errors in untranslated segments.
Conclusion. As many other plug-in tools, WordFast provides quite a good solution for those who select it as a TM tool and do not want to implement a standalone QA tool.

Saturday, April 11, 2009

QA functionality in SDL Trados QA Checker 2.0

SDL Trados QA Checker 2.0, plug-in to SDL Trados 2007

Supported checks. Basic set of checks performed by SDL Trados QA Checker is rather extensive compared to other plug-in tools and may be extended using regular expressions. This tool does not allow to specify Chinese full stops as valid punctuation marks. Moreover, Arabic and even Easter European characters cannot be included into forbidden characters list. It also does not check quotation marks and number formatting.
False positives. Unlike all other tools, it generates false positives by counting skipped and empty segments as incomplete ones.
Conclusion. In general, the tool is good enough for translators who work in Trados TagEditor, but may be hard to employ in dedicated quality assurance departments where batch processing of mono- and multilingual projects is normally required.

Friday, April 10, 2009

QA functionality in Star Transit XV Professional

Star Transit XV Professional, version 3.1 SP 21 Build 617

Supported checks. Star Transit employs the most limited number of checks without any further customisation. Available customisation is provided via fixed value lists and does not allow to add e.g. custom delimiters which in our case was necessary for Farsi .
False positives. Due to the limited number of checks supported by Star Transit it generates one of the lowest numbers of false positives.
File import. Import of TTX files is not correct enough; tags are represented in an unusual manner which hinders work with files.
Right-to-left language support. This tool proved to be surprisingly good at checking terminology in right-to-left languages. It also showed probably the best handling of right-to-left languages in general.
Reportability. The tool does not provide any reports; all errors need to be corrected “on the fly”.
Conclusion. In general, the real functionality of the tool is closest to the claimed one; however, it is too limited.

Thursday, April 9, 2009

QA functionality in SDLX 2007 QA Check

SDLX 2007 QA Check, build 7014
Supported checks. Basic set of checks performed is also rather limited. A user can extend and customise it using regular expressions; however, regular expressions are often beyond the qualification of a QA manager.
This tool does not check number values and does not check number formatting, double punctuation marks and brackets unless you set up a regular expression. It also does not check tags, and though it is hard enough to change tags in SDLX, TTX files converted to SDLX format may contain corrupt tags which won’t be detected.
Skipped translations are not converted from TTX files and therefore are also not found.
SDLX does not allow specifying Chinese full stops as a valid punctuation mark.
False positives. QA Check generates false positives for forgotten translations (counted as partial translations as well). Also many false positives are generated for partial/incomplete translations (they are not differentiated in SDLX) because incompleteness is determined only by translation length, not taking into account the number of sequential source words found in target segments.
Multilingual project support. As many other tools, QA Check Checks translation consistency between different languages.
Right-to-left language support. QA Check displays such texts left-to-right which hinders work with files and leads to reporting non-existing terminology errors.
Conclusion. With additional customisation, this tool is quite a good solution for SDLX users that do not want to involve additional standalone tools into their work processes.

Wednesday, April 8, 2009

QA functionality in Déjà Vu X Workgroup version 7.5.302

It's been almost two years since I researched translation quality assurance tools available on the market. The full research was presented at Translating and the Computer 29 conference in November 2008 and is published at my company Web site. Needless to say, I would like to update the results and find something really new in this area.

So far, just to start with some facts, I'll re-publish short benchmark results from the research. And I'll start alphabetically, so Deja Vu is the first.

Déjà Vu
Supported checks. The number of default checks is rather limited; however, the application supports custom SQL queries which most probably allows for extending the amount of possible checks and further customisation.
Multilingual project support. While checking several files translated into different languages, Déjà Vu applies the TM and glossary for the first language to all files no matter what their target language is.
Right-to-left language support. For Arabic and Farsi it reported terminology errors even where the translated term could be easily found using Find feature.
Reportability. The style of error indication probably fits translators who want to check the translation “on the fly”, but is rather inconvenient for a dedicated quality assurance department.
Conclusion. In general, this is one of few tools declared capabilities of which are close to real ones. The tool is only suitable for checking its native files. Although it supports other most common formats including Trados, SDLX and Star Transit, conversion is quite time-consuming and is not in general worth it.

Monday, April 6, 2009

SDLX sickness

This weekend I thought about what will Trados and SDLX together result into. And I recalled my very first tests that I did in 2002 or 2003. I had to make a comparison and select a TM solution for our new company and our main client. I'll try to find this comparison chart in my archives and publish or quote it here. I do remember there were Trados, SDLX, Deja Vu, Wordfast and some more TM systems.

Based on it, what I suggested to our client was SDLX. Why? Because it was nice, made use of TMs and glossaries in quite a convenient way, supported a lot of file formats and languages and had a free "lite" version for freelance translators. But what was maybe the most important thing was that the company was very responsive. They replied messages immediately, fixed bugs within 24 hours and always were ready to help, even to import our files into a new project if we experienced some troubles doing it locally.

I still remember the names of people who responded my mails. It was so nice to feel they care and you may rely on them. There were a lot of different problems with this software, but if we ever missed a deadline, it was not because of them. They all got resolved promptly.

SDL was growing, SDLX was developing, new features were added and old features were considered to be "obsolete". Free lite version was discontinued. Technical support first became slow, then paid. I don't know how prompt is currently their paid support. Unpaid one is pretty slow and useless, at least if you don't know addresses of real people you need to contact.

Knowing some SDL people almost in person, I'm quite sure they are pretty dedicated and doing their best, just like their predecessors were back in 2002 or 2003. What's changed is the environment they're in. I don't know if it's true for all large corporations, or if SDL just was unlucky on this way, or it it was their intention and they're really lucky to achieve their goal. But I really miss that nice cozy company that I selected 6 or 7 years ago.

Thursday, April 2, 2009

A few notes on SDL Trados QA Checker 2.0

A few days ago I had to perform formal QA on a project translated from English into German. I used SDL Trados QA Checker 2.0 (SP3) for this tasks and have a few interesting findings.

First of all, I received a lot of capitalization error reports. Apparently all nouns are capitalized in German which is not true for other languages, and this result into differences in capitalization. Quite annoying though!

Then, it reports repeated words like "OCR-A OCR-B" or "Task 1, Task 2", which is also annoying.

Third, it reported untranslated segments, and the segments really were untranslated. The problem was that those segments were just untranslatable, and TagEditor didn't even open it, but QA Checker still reported it. It's a good added safety feature, but annoying again. One example of such segment:
©2009

And last but not least, report on inconsistent translation is hard to work with as it doesn't reference the other translation compared to which this one is inconsistent. You have to use search feature and keep in mind what translation was there and what is here. Needless to say, annoying.

I would like to apologize to Patrik Mazanek who developed this plug-in. It in fact is a very useful and very capable tool, it can detect a lot of errors and is quite flexible, but as long as it works good, I apparently don't notice how good it was at detecting this or that error. And as soon as it comes to annoyance, I certainly notice it. Anyway, this post is apprently not to state how bad QA Checker 2.0 is. It's really great. But the purpose of this post is to make people (including Patrik and other developers who will be able to solve the problems) aware of the existing drawbacks.

Translation tools