June 2008


China and Japan and Korea and Language and Thoughts30 Jun 2008 03:15 am

As historians, we often engage in the liberal use of quotations to sanitize and quarantine distasteful terms or phrases that lend legitimacy to a category or a way of referring to an institution or other body. The use of these quotes, which I confess to frequently using, presumably robs such terms of their nomenclatural power and further serves to establish distance between us and the ideas and terms we enlist to talk about the past.

Finally, use of these quotation marks excuses us from having to spend time analyzing the terms themselves, putting them aside as if to say, “Yes, yes, this is a very inappropriate term that needs careful and sensitive discussion, but since I’ve a lot to do in this essay, I just can’t be bothered at the moment to deal with it.”

Some people seem to feel that the aesthetic impact on one’s work is such that the frequent use of quotations is just not worth it, or perhaps feel that we simply aren’t accomplishing anything useful by using them for direct translations or referrals to terms as they were used decades or centuries ago. However, not using quotations or confronting problematic terms can earn the ire of book reviewers, as I discussed in a response to a review of the book Collaboration by Timothy Brooks. Brooks was criticized for used the term “pacification teams” to refer to the units the Japanese called “pacification teams” in occupied China during the war even if he is anything but sympathetic to the Japanese in his book.

One strategy is to use quotations once, and then announce that you won’t be using them anymore. I came across this tactic today when reading a Chinese translation of an essay by Matsuda Toshihiko, called 日本帝國在殖民地的憲兵警察制度:從朝鮮,關東州致滿洲國的統治樣式遷移 (English title was listed as “The ‘Gendarme-oriented’ Police System in the Japanese Colonial Empire: The Transfer of Models of Rule Used in Colonial Korea to Kwantung Province and Manchukuo”) After putting Japan’s 內地 (the interior of Japan = Japan proper excluding its colonies) and terms like 滿洲 (Manchuria, 滿洲國 Manchukuo, the largely Japanese controlled Manchurian state from 1932-1945, often called 僞滿州 or the “puppet Manchukuo”) in quotations, he follows each with “一下省略括號” (“Brackets left out below”).

Another strategy that can sometimes be used, which is one I follow for some words like “traitors,” is to embrace a word and use it quite shamelessly in order to deliberately provoke the reader. In English, the word traitor has lost much of its punch of late – a good thing in my opinion – but still holds great power in many other places and languages. The discomfort generated by the word and the way it forces readers to think about what it really means is part of what I aim to achieve when I use the term. Far from wanting to contribute to the term’s legitimacy, my deliberate use of it is partly out of a kind of mockery, but more importantly out of a desire to help set the scene of the politically charged context in which it was used.

Though I can’t speak for them, I suspect something similar is being done in some other famous cases of this. Some scholars of Korean history have been strongly criticized for using words like “terrorist” to describe Korea’s national tragic hero Kim Koo. I suspect these same critics would have much less opposition to him be referred to by his popular nickname, “the assassin.” I really don’t have strong feelings on this issue and I don’t think it is as straightforward as my own case, but it raises some interesting questions. What if these scholars are also engaging in a dual process of linguistic mockery and deliberate attempt at reviving a historical scene? Should the word be off limits entirely, should it necessarily be accompanied with quotations, or are there alternatives? What I think escapes some critics of such scholars is that I believe at least some of them are using the word terrorist not as a way to conjure images of Kim Koo as a suicide bomber in a crowded market but, on the contrary, to show how the word terrorist has itself a history and potentially embraces a wide range of figures we might be less willing to unconditionally condemn. In doing so, they potentially open a space in which to critique the way the word has come to be used and what it now narrowly represents, as well as the wide range of activities and contexts it covered both in the past and now. Can we only engage in such a rhetorical technique through the use of quotations?

I’d be interested in hearing from other students and scholars about this. What strategies do others take when they are faced with the need or potential need to establish quotational quarantines? What conventions do you follow?

Workshop29 Jun 2008 06:09 am

In addition to my Muninn blog, I occasionally make contributions at the various Frog in a Well blogs, the East Asia Libraries and Archives wiki, some programming weblogs at Fool’s Workshop, and create various scripts and other creations. I have a Friendfeed profile but I don’t really like leaving this sort of thing to third party sites that come and go with the fads. I’ve decided to keep track of most of my own online projects with a little site and feed called the Workshop Wire.

Workshop Wire (The RSS Feed for it is here)

China and Language and Tech and Workshop22 Jun 2008 02:24 pm

The Problem: Let us say you have a list of Chinese words or single Chinese characters in a file. There are a lot of them. You want some easy and fast way of getting the pinyin and English definitions of that list of words or single characters and you want this in a format that can be easily imported into a flashcard program so you can practice these words.

Today I faced this kind of problem. There are lots of “annotator” websites online that make use of the free CEDICT Chinese dictionary but I have yet to find one which outputs a simple, and nicely formated (with all [...], and /…/ stuff removed) tab delimited vocab lists.

I have recently been frustrated by the fact that I often come across Chinese characters that I haven’t learn, or, more often, characters that I only know how to pronounce in Japanese or Korean. I also am frustrated at the fact that I have forgotten the tones for a lot of characters I knew well many years ago when I studied Chinese formally.

Over the summer I want to review or learn the 3500 most frequently used Chinese characters, particularly their pronunciation, so that I can improve my tones and more quickly lookup compounds I don’t know. (( The top 3000 make up some 98-99% when their cumulative frequency is considered. ))

I found a few frequency lists online (see here and here for example) and I stripped out the data I didn’t need to create a list with nothing but one character on each line. (( A few of the frequency lists I have seen have Cedict dictionary data included but not in a very clean format )) Although it is an older list based on a huge set of Usenet postings from ‘93-’94 you can download an already converted list of 3500 characters here. (( I notice that there is a high frequency of phonetic hanzi for expression emotion in the postings and some other characters one doesn’t come across as often in more formal texts, I actually don’t mind ))

Since I’m not in the mood to look up 3500 characters one by one, I spent a few hours this evening using this problem as an excuse to write my second script in the Ruby programming language.

In the remote possibility that others find it useful who are using Mac OS X, you can download the result of my tinkering here:

Cedict Vocabulary List Generator 1.1

This download includes the 2007.8 version of CEDICT, the latest I could find here. (( If you find a newer version (in UTF-8) put it in the same directory as my script and name it cedict.u8 ))

How this script works:

1. After unzipping the download, boot up the “Convert.app” applescript application. It will ask you to identify the file you want to annotate. It is looking for a text file (not a word or rich text file) in Unicode (UTF-8) format with either simplified or traditional Chinese characters or word compounds, one on each line.

2. This application will then send this information to the convert.rb ruby script which will search for the words in the CEDICT dictionary in the same folder, format the information it finds (the hanzi, pinyin, and English definition), including the putting of multiple hits for the same character/word within the same entry with the definitions numbered. It does not currently add the alternate form of the hanzi (it won’t add simplified version to traditional or vice versa).

3. It will then produce a new file with the word “converted” added to its name. It will create tab-delimited files by default but you can change this by changing this option at the top of the convert.rb file in a text editor.

4. Though this version of the script doesn’t do this yet, you may want to run the resulting text through the Pinyin Tone dashboard widget or a similar online tool such as the one here or here. That will get rid of the syllable final tone numbers and add the appropriate tone marks. I am having a bit of trouble converting the JavaScript that my widget and this site uses into Ruby so if anyone is interested in working on this let me know!

If the script doesn’t work: make sure you are saving your text file as UTF-8 before you convert. I am also having trouble when my script is placed somewhere on a hard disk where the path has lots of spaces. Try putting the script folder on your Desktop.

Note: If you don’t have Mac OS X but can run Ruby scripts on your operating system, you may be able to run my script convert.rb from the command line. It takes this format:

convert.rb /path/to/file.txt /path/to/cedict.u8

UPDATE 1.1: The script now replaces “u:” with “ü” (CEDICT uses u:).

General12 Jun 2008 10:36 am

Wenlin is the the best piece of software around for students of Chinese. Among other tools, it has a powerful and handy offline dictionary with very flexible and fast search options as well.

I know many students of Chinese that use Wenlin to get their definitions and input vocabulary into flashcard software. Most recently I saw someone do this in a coffee shop here in Taipei, and it brought back a lot of memories of me doing the same in Beijing almost a decade ago.

Wenlin doesn’t make it easy for you, however, to get the word entries into a format that can be easily imported into flaschard applications. There is no “export” feature, presumably because the developer doesn’t like the idea of large parts of the Wenlin dictionary getting out of the software and into a separate database. However, the lack of such a feature means that students have to copy and paste words from Wenlin and add their own tabs. In my case, I also like to delete the alternate hanzi to keep my flashcards more clean.

Although a more experience programmer with good regular expressions skills could easily take this further, I am releasing the results of an evening spent trying to learn how to program in the programming language Ruby:

Wenlin Conversion Script 1.3

Here is a screencast explaining how to use the script:

Wenlin Conversion Script Screencast

This script takes a text file with a list of Wenlin dictionary entries (Saved in TextEdit, not in Wenlin) and puts tabs between the hanzi and the pinyin and between the pinyin and the definition. It saves the converted file which can then be easily imported into your favorite flashcard program.

It is made up of two scripts: the convert.app applescript application which you is what you use to run the script and the convert.rb ruby script which does the actual conversion. You can customize three options in the convert.rb script. Just open it up and set the three option variables at the top to true or false according to your preference for that option. There is a description of what each option does in the ruby file but basically they control whether the alternate traditional/simplified hanzi are removed, whether the “|” character is changed to “Example: ” and the “~” in examples replaced by the pinyin of the word.

I haven’t tested this too extensively so if you see it do strange things with the wenlin vocab items let me know and I’ll tweak the script in the future.

UPDATES:

-I just noticed in the screencast that it split the word “fandong fenzi” and put “fenzi” into the definition – I need to update the regular expression so that it looks for the part of speech rather than a space to separate the pinyin from the definition. I didn’t realize that Wenlin sometimes puts spaces into its pinyin words. I’ll release this soon.

-I just updated a 1.1 version. See the enclosed Read Me file for things I have fixed and changed in this new version of the script.

-I just updated the script again to 1.3, see the readme in the download for the details.

Taiwan07 Jun 2008 09:51 am

P1000921.JPG

I’m a huge fan of Korean food and I will sorely miss it. I’m generally not as fond of the many oily and fried foods commonly found throughout China and Taiwan. I’m a big fan, however, of the many vegetarian buffets I have been to in Taipei. For the price of $2-4 you get a plate full of vegetables, tofu, rice, and vegetable dishes, eggs, and soup. These light meals work great to balance a heavier meal.

Language and Taiwan07 Jun 2008 09:40 am

P1000906.JPG

“Let Obama teach you charming English”

Above is an advertisement at some kind of English academy near Chinese Culture University in Taipei. Somehow I don’t think this concept would have worked with our sitting president as a model.

Taiwan05 Jun 2008 06:43 am

I just remembered an anecdote I mentioned in a longer posting here at Muninn as I passed by one of the street side trash cans here in Taipei today where, even when it is far from full, one has to concentrate to get a coffee cup through its small hole. Re-blogging the memory in question from the old posting:

[Street-side trash cans] are labeled “For Pedestrians Only” and apparently this warning is enforced. When I was walking down the street one morning I witnessed an old man ride past a garbage can on his bicycle. Stopping clumsily, he squeezed a small bag of garbage into the garbage can. Just as I passed, witnessing this, a man leapt off of the bench to the right of me…and yelled at the man while flashing some kind of official ID. I couldn’t understand his crazed shouting but I imagine it was something like, “Freeze! Garbage Police!” I watched in fascination as the young garbage cop lectured the old man, who bowed his head low and listened quietly. I didn’t stick around to see if the old man was fined, but considering the fact that eating or drinking on the subway in Taipei can get you a $1,500 Taiwanese dollar fine, I wouldn’t be surprised if he was slapped with a hefty amount for his blatant misuse of the public garbage can.

Personal and Taiwan05 Jun 2008 06:37 am

My Fulbright in Korea has ended, as has a year of language study and dissertation research in Seoul. I moved to Taiwan on Monday and will be here in Taipei until the end of the summer. I’m quite fond of this island and look forward to shifting my research to Chinese language sources. I just moved into a new apartment today and am pretty much up and running in my new life here.