Telatus Linguae – A case study on localizing the Wizarding World

Copyright Disclaimer: under section 107 of the Copyright Act of 1976, allowance is made for “fair use” for purposes such as criticism, comment, news reporting, teaching, scholarship, education, and research. This project is a proof-of-concept, and as such does not represent nor infringe on the creator(s) in any way.

Introduction

The Wizarding World (previously known as J. K. Rowling’s Wizarding World) is a fantasy media franchise and shared fictional universe centered on the Harry Potter novel series by J. K. Rowling. The series of books have been translated into 80 languages and half a billion copies were sold worldwide in 2018. The total box office of the movies took in 7.7 billion dollars across the globe.

Despite this global appeal, the official website of The Wizarding World remains surprisingly monolingual! Localizing the website into more languages allows global fans from different linguistic backgrounds to engage with the content in their native languages. This can significantly increase the website’s user base and overall reach. In this case study, we delve into the intricacies of the website’s localization, exploring the potential benefits of multilingual support and questioning the reasons behind its current linguistic limitations.

Workflow Overview

Timeline

Set up a preliminary timeline for this project.

Website Analysis

Wizarding World serves as a central hub for fans of the Harry Potter franchise, providing a digital platform to explore and engage with the magical universe in various ways. The website offers a range of content, including exclusive writings by J.K. Rowling, insights into the characters and settings, quizzes, and interactive experiences designed to immerse visitors in the magical world.

Visitors to the Wizarding World website can access information about the four Hogwarts houses, discover their Patronus, explore the history of magic, and dive into additional details that enhance the overall Harry Potter experience. Additionally, the website has features that allow users to engage with the community, participate in discussions, and connect with fellow fans from around the world.

According to the traffic analysis by Similarweb, the top three non-English regions are Brazil, Japan and France, making up more than ten percent of the total traffic and there are a lot of room for growth. By adding at least Brazilian Portuguese, Japanese and French, the website can provide a better fan experience by tailoring content and design to match local expectations.

By the time I studied this case, I noticed that one single language switcher (Japanese) was added to a popular subpage – The Sorting Ceremony. This marked its first baby step of localizing the website into multiple languages. Fun fact, the United Kingdom is consistently ranked among the top ten countries in “Countries to Visit” rankings conducted by travel agencies and other organizations for Japanese people.

Any localized content would resonate better with fans, leading to increased engagement and higher conversion rates. When fans can easily understand and relate to the content, they are more likely to take desired actions, such as making a purchase on the franchise items. Multilingual version of The Wizarding World is essential for effective global marketing as well. It enables the client to customize their marketing messages to different regions, considering cultural nuances, preferences, and market trends.

Clone the Website

Compared to recreating the website from scratch on WordPress, using a website cloning tool like HTTrack is much more easier than borrowing contents nib by nib from The Wizarding World.

After downloading and installing the software, we can simply copy and paste the URL and HTTrack will begin to crawl and copy the website automatically, which will then be hosted on my local machine. This process may take a while, depending on the size of the file. The Wizarding World is a complicate websites with a lot of subpages. So it’s best to choose one simple subpage for the purpose of displaying.

Below screenshot is the cloned website subpage. You can see there are a few pictures missing, but the structure, design and layout, especially color scheme and fonts, are identical to the original site.

Proxy Translation & Demos

We propose to use a solution called Proxy Translation to calculate the word count to estimate translation cost and also create demos of respective languages to showcase the localization output in the process.

Weglot is a platform designed to make it easier for website owners and developers to translate and manage the content of their websites into multiple languages. It provides a simple and efficient way to create a multilingual website without the need for extensive coding or manual translation efforts.

Weglot typically works by integrating into the website through a plugin or adding a code snippet. Once integrated, it automatically detects and translates the website content into the languages of client’s choice. It also offers a user-friendly interface for managing translations, allowing users to edit or customize translations as needed.

Compared to what we used to do for website localization (WPML, Crowdin, etc.), proxy solution is much more intuitive. It takes less time and even the strings that are more complicated to localize can be easily dealt with. This is particularly handy for managers without technical background and suitable for websites that won’t change too much over time. The Wizarding World is one of those websites.

Work Consulted

https://en.wikipedia.org/wiki/Harry_Potter

https://www.msn.com/en-us/travel/news/a-magical-trick-that-attracts-even-modern-people-why-do-japanese-people-love-harry-potter-so-much/ar-AA1bpwAj

https://www.elegantthemes.com/blog/wordpress/weglot-plugin-wordpress

https://verpex.com/blog/website-tips/how-to-make-a-duplicate-website

Odd but Understandable? Sorry, I don’t buy it.

A Dog Have I.

What do you think of this sentence? Bizarre? Incorrect grammar? Odd but understandable?

This is one of the examples from Post Editing Guidelines For BOLT Machine Translation Evaluation rule 3.5, indicating that this kind of sentence structure is “readily understandable“, and the rule suggests “DON’T rearrange the phrases just to improve the fluency“.

From my personal perspective, translation like A dog have I” is NOT acceptable, at least not in English (Maybe in German it’s fine, “Ich habe einen Hund” = “Einen hund habe ich“). The question I’m having is: Where do we draw the line? Theoretically, a “dog” can be a subject in the sentence. Why can’t “I” be owned by a dog?

The other examples in the rule meet the requirement of “odd but understandable”, but what if the whole translation is full of those kind of sentences with chaotic word order? How would that impact the readability? 

Maybe the reason why I’m having doubt is that I’m not a native English speaker. Interestingly, in Chinese, there was a similar popular test on the sequence of character order that I totally understand: 

研表究明,汉字序顺并不定一影阅响读。(The study shows that the sequence of Chinese characters does not affect readability.)

The sentence above is not written with correct character order but Chinese people won’t have any trouble reading it. What happened here is that our brains auto-corrected the mistakes in the sequence. When we read Chinese, we tend to segment the sentences into several chunks. Each chunk consists of several characters. We absord information chunk by chunk and we pay less attention to individual characters.

Back to English. I have asked ChatGPT to generate a paragraph specifically with sentences that have wrong word order:

To be honest, this looks like a disaster to me. But I’m curious about what you (especially native English speakers) might think. Feel free to leave comments!

Project Showcase: LAWful Translation

LAWful Translation is named after the initials of the three founders: Lide, Amory and Winston (yep, that would be me). We came up with the name to finish our semester-long project in the Translation Technology class. This project showcase includes proposal, deliverables and our key takeaways on the usage of CAT tools and translation workflows.

Introduction

The scenario is that we (LAWful Translation, translation vendor) are delegated with a translation job by our client (China Energy Conservation and Environmental Protection Group, CECEP for short). The specific objectives of this localization project, forming part of the broader commitment under previous overarching contract, is for us to deliver the localized content of the 2023 CSR report from Chinese to English.

Tasks and Deliverables

TasksDeliverables
Creation/Update of the term base and translation memory regarding this projectTerm base in .xml format; translation memory in .sdltm format
Creation/Update of the style guideStyle Guide in .pdf format
Pseudo-translation of the same documentPseudo-translation in .docx and .pdf format
Chinese to English Translation of the Social Responsibility ReportTranslated document in English in .docx and .pdf format
One round of Proofreading and Editing(same above)

Process & Timeline

Lessons Learned

Considering the client has provided the translated version of the report from 2022, we took that into consideration and worked on the quote with reasonable discount based on fuzzy matches. By creating the translation memory with the alignment of that version, we are able to finish the translation in a much shorter time.

One thing during our kick-off meeting with the client is that he asked about involving a reviewer from his side in our workflow. We weren’t sure about where exactly we should implement that step but later after some discussion, we have decided to involve the reviewer as early as possible. 

The reason is that the reviewer should be informed about the style guide prior to the translation. If their participation is too late, we might face a lot of back and forth during the translation or even after the translation is done. 

However, one thing I would like to point out is that whenever we involve an external personnel, especially from the client side, we must treat with extra caution. It takes time to familiarize the cooperation and one way to keep things professional is to create a “review form” visible to both sides, logging all the feedback and addressing them with care.  

I believe when we perform in a professional way, we can work out any disagreements.

Documents