Subscribe to our E-Newsletter

Submit

Smith's Blog



Posted Jul 26, 2010 at 11:52 AM by Smith Yewell

For information on why we joined the Open TM2 initiative, please follow this link: http://www.lisa.org/globalizationinsider/2010/07/open_tm2.html

I have also included the interview below.

Smith


What is it about the Open TM2 initiative that motivated Welocalize to get involved?


Most translators use some type of translation workbench tool. Most clients use some type of content management tool, and most vendors use some type of translation management tool. To make it even more interesting, add machine translation tools, authoring tools and a variety of content types. Now combine all of those users and their various tools and try to pass the content type you want translated between each of them, and tell everyone they have half the budget, time and staff to do it!

Yes, I have exaggerated a bit to make a point, but the basic elements of this challenge are what I am hearing from clients, vendors and translators. Traditional methods across our translation supply chain are just not up to the task of the now always-on velocity of end user demands.

In order to increase velocity across the translation supply chain, we need to increase automation which implies more integration, interoperability, extensibility – and standards. We are by no means the first industry to confront this challenge, so why not borrow what has worked elsewhere. At the heart of every sophisticated and mature supply chain is a consistently followed set of standards. As Craig Barrett, former Chairman of Intel, stated, "The world is getting smaller on a daily basis. Hardware, software and content move independent of, and irrespective of, international boundaries. As that increasingly happens, the need to have commonality and interoperability grows. You need standards so that the movie made in China or India plays in the equipment delivered in the United States, or the Web site supporting Intel in the United States plays on the computer in China."


What sort of progress do you think has been made in the area of standards, and what work remains?

Unicode has probably been the most successful standard related to our industry. Unicode specifies a standard for the representation of text in just about any language across software products and systems. Before Unicode, there were hundreds of different encoding systems, and they often conflicted with each other. The significant problem was potential corruption in the passing of text representation data between different encodings or platforms. Thus the Unicode Consortium was formed, and to its credit, Unicode now “enables a single software product or a single website to be targeted across multiple platforms, languages and countries without re-engineering. It allows data to be transported through many different systems without corruption.”

Other standards, such as TMX, have not been as successful. We need to understand why this has been the case? As Bill Sullivan, IBM Globalization Executive, stated, “There is a recognized and growing need for standards in the localization industry. Despite our best intentions, however, standards themselves can often be vague and open to multiple interpretations. What is needed are reference implementations and reference platforms that serve as concrete and unambiguous models in support of the standard.”

This is the work that remains. We need to demonstrate more tangible benefits for adhering to a standard in typical use case scenarios and integrations. How can a client easily integrate the translation assets of an acquisition? How can a client plug-and-play what they deem as the best tool components? How can a client change tools? These are the simple questions I hear. To get closer to the answers, the Open TM2 Steering Committee is working on a Joomla (content management), Open TM2 (translator’s workbench) and GlobalSight (translation management system) integration. The goal is to develop a viable data exchange standard which works seamlessly in this 3-way environment and then extend it to other integrations in the translation supply chain.

LISA will document and publicize the resultant standards. However, neither the Open TM2 initiative nor LISA alone can make the greater vision a reality. As the Unicode initiative demonstrated, broad participation and support across the industry is necessary to achieve success. The Unicode Consortium includes corporate, institutional, individual, NGO and public sector members all collaborating with a unified purpose.

What is different about the Open TM2 initiative?

Open source and “free” are often found in the same sentence. Yes, there is no charge to download an open source product such as Open TM2 or GlobalSight, but there is a cost associated with support, training and customization to specific needs. Open source is not a “free lunch”, but it is an opportunity to engage, integrate and customize at a much deeper level and at a faster pace. The result is potentially a product that is more suited to one’s needs, more easily integrated with other products and a lower total cost of ownership. But what you get out of it is subject to what you put into it. As an ancient Chinese proverb reads, “Talk does not cook rice.” We need people willing to take action. These concepts apply to all open source projects.

What I think is different, and exciting, in this Open TM2 initiative is an increasing alignment of broader interests. Industries typically do not change significantly until the market forces them to change (look at the American auto industry). I think there are some market mega trends in play right now (cloud computing, mobile computing, social computing, open source) and those who don’t adapt to these trends will quickly be left behind. The “translation project” as we knew it traditionally is rapidly morphing into on-demand translation. SimShip is rapidly morphing into SimStream (simultaneous streaming releases). Translation tools and platforms are rapidly morphing into “mash-ups” (combinations of different tools with the sum benefits being significantly greater than the individual benefits). The translation service on the whole is rapidly morphing into a utility inside a broader and more deeply integrated global content supply chain. RFPs now have pages and pages of interoperability, integration and optimization questions. And according to Gartner, "The number of open-source projects doubles every 14 months. By 2012, 90% of companies with IT services will use open-source products of some type.”

So, I think the timing is right. Many, certainly not all, clients, LSPs, tool providers and translators alike are realizing that it is in the best interest of the supply chain as a whole to collaborate to achieve something on the scale of what was achieved with the Unicode standard. “Do not go where the path may lead, go instead where there is no path and leave a trail.” – Ralph Waldo Emerson ...

Posted Jul 9, 2010 at 9:49 AM by Smith Yewell

I was recently inteviewed about the Open TM2 initiative by John Yunker from Global By Design.  The interview can be found here.  I have also copied it below.

Translation memory goes open source: An interview with Smith Yewell of Welocalize

Posted on July 8th, 2010 by John Yunker

Translation memory helps companies re-use previously translated text, improving consistency and potentially saving money.

But translation memory requires using translation memory software, which has for years largely meant using SDL Trados software.

When a company hires a translation agency and requires that they use translation memory — not only must that agency have Trados software, but so too must the freelance translators — who are often located all around the world. This is a nice business model for SDL, but it has been a pain point for translators and agencies for years.

For agencies, the more acute pain point has been that SDL not only sells TM software but also sells translation services. Nearly every translation exec I have spoken to has openly asked for an open-source alternative to Trados.

Well, now we have one.

IBM has partnered with LISA (Localization Industry Standards Association), Welocalize, Cisco, and Linux Solution Group e.V. (LiSoG) to launch an open source project that provides a “full-featured, enterprise-level translation workbench environment for professional translators.”

It’s called Open TM2 — and it’s basically a scaled-down version of what IBM has developed and used internally for years. I haven’t used the product yet and there’s understandably quite a bit of work involved to get this software to a point where it’s easy for translators, agencies, etc. to consume.

I’m not prepared to say Open TM2 is going to put an end to Trados. After all, Linux didn’t exactly put Windows or OSX out of business. But I am excited to see it out there in the world. Open source keeps software vendors on their toes. I’ll be very curious to see if developers embrace the code, and what they come up with.

To learn more, I interviewed one of the partners behind Open TM2, Smith Yewell, CEO of Welocalize.

Here is what he had to say:

Q: Why did IBM decide to open source its software in this fashion? What does it hope to gain?
Bill Sullivan can answer this question better than I, but as he stated, “Freelance translators are the backbone of the localization industry. These translators have longed for free and open translation tools to increase their productivity. There is a recognized and growing need for standards in the localization industry. Despite our best intentions, however, standards themselves can often be vague and open to multiple interpretations. What is needed are reference implementations and reference platforms that serve as concrete and unambiguous models in support of the standard.”

In my opinion, productivity and standardization go hand-in-hand. By releasing Open TM2 as an open source product with a standards-based, data-exchange goal, not only is there potential for increased productivity – flexibility and freedom of choice also increase.

Q: And what do you hope to gain from this effort?
I like to use the mobile phone analogy. I can travel just about anywhere in the world, turn my phone on, and it works. This is possible, because competing carriers and hardware manufacturers collaborated to be able to offer that seamless user experience across global networks and handset protocols. Consider the user experience in our industry. There is really no ability for a client to turn on a translation supply chain and have it work out of the box across various content types, tools and translation vendors. The clients I speak with are demanding that this change.

GlobalSight, Joomla and Open TM2 are being used to demonstrate an example of a seamless data exchange based upon a set of standards. LISA will play an important role in documenting and sharing these standards so that they can be applied uniformly to other integrations. To put it simply, we need a variety of tools to be able to talk to each other in an automated way. This is where I think we can improve time, cost and quality results and greatly improve the user experience. Ultimately, I expect Welocalize to gain an increase in productivity, interoperability and freedom of choice in configuring the best set of tools for each client’s unique translation supply chain needs. If we can get under the hood, we can tune the engine; otherwise, it is becoming increasingly difficult to gain time, cost and quality advantages from the old way of doing business.

Q: Who is going to use this software? And what software will it replace?
Many translators are already using TM2 in delivering work to IBM. I expect Open TM2, as its features grow, will appeal to more translators as a desktop workbench. This is only an initial release of the open source product, and there is much work to be done. But the potential is there to collaborate and improve. Ultimately, I think Open TM2 has the potential to replace the Trados desktop workbench.

Q: When you talk open source, stability and support are common pain points. Who will be actively supporting this effort?
The members of the Steering Committee are currently supporting the effort, and the goal is to build a community which can support itself. This open source initiative is not unlike others, what one puts into it will determine the benefits one can pull from it. I wouldn’t be surprised to see a company create a business model to offer Open TM2 support. Support, training and customization are typical services that bloom around open source initiatives.

Q: What would stop a technology company from taking the source code and creating a competitive TM product?
It is an open source product, so there is potential for companies to build a business model around the product. However, I doubt that will be a proprietary fork of the code. The appeal is an open source product with growing standards compliance, not yet another proprietary product. What is more likely are support, training and integration services. Anyone investing in the product naturally expects a return, and the better the return, the more healthy and diverse will be the community. I think that is a good thing. Competition drives innovation. However, if we can’t get the standard data-exchange protocols right, productivity across the supply chain will continue to lag the increasing velocity of change in the marketplace. Rapidly evolving time, cost and quality demands already exceed what the traditional translation supply chain can deliver.

Q: The source code is available now but documentation is lacking. What is your timetable for launching a more translator and agency friendly product.
I think the first step for the Steering Committee is to take the feedback that is already coming in about the product, good and bad, and use that to set priorities, responsibilities and a timeline. The idea is sound, but it must be tested in practical use and refined according to what the market really needs. Translators have the answers to many challenges in our supply chain, they are just not asked very often.

Q: How will this software be integrated? Is there is a goal of integrating it with the open source GlobalSight CMS?
Content creation, translation, workflow and performance metrics reporting – there are many systems and tools for accomplishing each of these requirements. However, very few of them can pass necessary data in an automated way. A lot can be accomplished with web services and open APIs, but widespread integration possibilities can only be realized with a critical mass actively using an industry-supported data-exchange standard.

In order to demonstrate this possibility in a live use case scenario, Joomla, GlobalSight and Open TM2 will be integrated with the resultant standards published by LISA. I think additional standards organizations will also need to participate to gain wider understanding, agreement and adoption. If enough of the industry’s thought leaders and leading practitioners get behind this standard data-exchange and tools integration challenge, I think all boats will rise. Without it, the industry will never be able to approach the growing volume of content which current production and cost models can’t support. ...

Copyright 2009 Welocalize