Only 50% Of Twitter Messages Are In English, Study Says
Robin Wauters
Feb 24, 2010

Paris-based Semiocast, which helps brands understand and interact with real-time Web services, has performed a semantic and quantitative study of Twitter based on an analysis of 2.8 million tweets.

Turns out roughly half the tweets posted on the micro-sharing service are in English, down 25% from last year, even though the company is based in the U.S. and has more users and momentum in English-speaking countries than anywhere else on the planet. The analysis further showed that the top 5 languages used on Twitter are English, Japanese, Portuguese, Malay and Spanish.

Semiocast says the study was conducted on messages gathered over a period of 48 hours, from February 8 to February 10, with the sole aim of determining which languages were most often used on Twitter. The messages were processed with the company’s own analysis tools, which it says can identify the language used in short messages for some 41 languages, including Greek, Hebrew, Chinese, Korean, Tamil, etc.

English is still the most used language on Twitter, with 50% of messages, although Semiocast says this is a far cry from the two-third share they registered for English in the first half of 2009. Semiocast also forecasts that its share will grow thinner in the future, as Twitter becomes more internationalized (i.e. becomes available in more languages) and its pervasiveness spreads to Asia and Latin-America.

Japanese comes in second with 14% of messages. This isn’t all too surprising; Twitter has been addressing that market for almost two years now. The third most used language is Portuguese with 9% of all messages, mirroring the success of social networks in Brazil.

The rapid adoption of Twitter in Malaysia and Indonesia, where Twitter has partnerships with two mobile telcos in place, shows in the rankings as well. Malay languages, including Bahasa Malaysia and Bahasa Indonesia, now represent the fourth most used language on Twitter, with 6% of messages. Spanish comes in fifth with 4% of all messages.

The ranks six to eight are occupied by major European languages, namely Italian, Dutch and German, each accounting for about 1% to 2% of total messages. French represents a little less than 1% of total messages.

For your reference: Twitter earlier this week claimed that it was seeing about 50 million tweets per day now, so an analysis of less than 3 million messages measured over two days may not be super representative. Nevertheless, it only takes a peek at Twitter’s public timeline to see that there are lots of people using the service in a language other than English.

Advertisement
  • Related Topics
Advertisement
  • http://phat1.com Slipdisc

    Interesting. I didn’t see Ebonics listed in the graph though.

  • http://www.OneHourTranslation.com Ron

    I hope everybody here already knows about Twitrans, the service that allows you to translate your tweets for free, using human translators only. This way you can be sure that your tweets are well translated (unlike machine translation). You can find it in Twitrans.com

  • http://sixservix.com/blog/david David Bonilla

    I’m a proudly member of this 4% of spanish twitts :) and I hope that it will increase very soon.

    Incredible this 9% of Brasil-powered-portuguese twitts :O

  • http://twittercism.com Sheamus

    It’s not quite the same, of course, as there’s less overall connection, but I wonder what the percentage of non-English updates on Facebook is? Or MySpace?

    Twitter’s aiming globally and one would therefore have to presume that ultimately tweets will meet with global language standards. English might be the ‘world’s language’, but only some 400m speak it as a first language, which ranks it below both Chinese and Spanish.

    As a second or third language it ranks higher, but one typically writes (and therefore tweets) natively. Hence, in five years or thereabouts Twitter – or whatever replaces it – will be predominately non-English, and by quite some measure.

  • http://twitdom.com/ Anuj Seth – Twitdom

    How about supplementing this with a graph that shows the country-wise breakdown of Twitter users?

  • http://www.technogati.com Surender Sharma

    It’s good if 50% twitter tweets in English because English is global language but in future it should be increase.
    Twitter also integrating local languages for many countries.

  • Robin Wauters

    Do you have one?

  • itgruendungmuenchen

    “English is global language”

    better learn chinese ;)

  • http://loqnerd.com loq

    wow! Malay language are in top 5! So amazing. Congrats buddies!

  • http://jacobian.web.id Jacobian

    well I am from indonesia and reading this news just make me feel proud to be indonesian.hehe…

    it’s just that many trending topics in twitter are started by indonesians, well as you know indonesia is a large country with many people in it. so you got the idea right? :-)

  • joão rosa

    Of course the big part of those 9% portuguese-speaking twits come from Brazil. But twitter is also ugely popular in Portugal, a success that started earlier then the one in Brazil. A smaller part come also from portuguese-speaking Africa, that follows what’s going on on portuguese and brazilian internet.

  • http://www.facebook.com/profile.php?id=1329780505 Visvesvaran Subramanian

    That was the first thing I thought when I saw this. I think it’d have a HUGE effect on the results…

  • http://www.facebook.com/profile.php?id=607095590 Adam Soutter

    /sarcastic voice

    “wow”

  • bay

    indonesia in top 5… amazing…

  • Brian D

    So now we know the main targets of the spam ?

  • http://www.fazreen.com Fazreen

    I’m proud to be Malaysian!

  • cikguWahida!

    Malaysia boleh! :D

  • http://martijnlinssen.com Martijn Linssen

    “not super representative”?

    http://blog.twitter.com/2010/02/measuring-tweets.html is indeed saying that there now are 50 million tweets a day: 48 hours would make 100 million tweets, not just 2.8 – dead give away what the “representative” percentage rate is there…

    That’s a mega-super understatement right there if you ask me…

  • http://roadtweets.com Jim

    I’m not at all surprised. It might have to do with their open API. For some reason our RoadTweets app is getting alot more traction in South Korea and Japan than the rest of the world (see here http://roadtweets.com/RoadTweeters ).

    Also, it appears many of the users are first time twitter users, so these apps are spreading the word about Twitter far and wide.

  • http://webecologyproject.org David Fisher

    It would be worth noting that the Web Ecology Project did this and had similar findings back in September 2009, plus we open sourced some code for doing it yourself! Oddly TC wasn’t interested in covering it then.

    http://www.webecologyproject.org/2009/09/code-release-google-language-tool/

    We analyzed 1M tweets at the time, but then the volume of tweets was slightly lower. We still could have probably ran our language detection on more, but the wait time was more with the Google language API- not with how many tweets we had.

  • http://www.facebook.com/profile.php?id=592822797 Scott Yates

    Hw cn u b shr ’bout dat? U no wht i meen??????

    My point in English: How can you be sure about that data? I mean, I can’t tell what half the tweets supposedly in English are saying. Just because they contain no words in English doesn’t say much.

  • igniman

    So , there IS non-english spam! what a revelation!

  • wyclif

    Surprising that Twitter hasn’t broken in the Philippines yet…Filipinos send more texts than any other country.

  • http://chimac.net/2010/02/24/half-of-twitter-messages-are-english/ Half of Twitter Messages are English « Chicago Mac/PC Support

    [...] of Twitter Messages are English It is really growing in other countries. I have noticed many other languages myself lately.  This is really exciting.  I think its such a [...]

  • ykrsdn

    Must be a real surprise to all Americans, “OMG, there are people speaking other languages?! What do you say, there’s a world outside the USA?!”

  • corne

    Sadly, with that 50% proportion.. 90% of it is a language spoken by bots. that leaves only a whopping 10% for real-genuine human language to be heard. :(

    As they said, Twitter has a saturation point. and that’s why FB is on a steady growth while Twitter is on a plateau. Twitter Details: http://bit.ly/twitter-1-billion-value-meltdown

  • http://aleksandar.grkov.com/blog/?p=11 “Only 50% Of Twitter Messages Are In English, Study Says” Tech Church | Aleksandar Grkov

    [...] Only 50% Of Twitter Messages Are In English, Study Says – Tech Church [...]

  • Terry McAssey

    Go Malaysia!

  • http://www.facebook.com/profile.php?id=25700318 Michael Evans

    Only 50%? that seems pretty high for the 3rd ranked language.

  • http://webecologyproject.org David Fisher

    Do you have anything to back up this claim that 90% of tweets are from bots? I didn’t see it in that link that you posted, or the method of how you collected that data.

    A member of our group wrote an (unreleased) bot detector and it was rather good- we didn’t pick up anywhere near 90% being bots on first glance. High- but not 90% bots.

  • http://webecologyproject.org David Fisher

    While that is occasionally a problem- it really isn’t generally for measurement.

    Google’s language detection (which is rather good) can figure things out rather well and return an accuracy index for how sure it is about the language. You can easily weight this into your stats.

    The harder part would be tweets that contain both English and non-English words.

  • Joao Pe de Mesa

    Esses 300 milhoes de Americanos retardados pensam que o resto dos 5 bilhoes de habitantes desse planeta que falam outra lingua nao usam computadores.

  • Javier

    Yes, but since most of the spam is in English, it’s very easy for us who speak other language, to recognize spam

  • http://www.thegeekplanet.com/2010/02/24/twitter-in-the-news/ the geek planet » Twitter in the news

    [...] According to an independent but small-scale study, only about 50% of all tweets are in English, down from [...]

  • http://www.facebook.com/profile.php?id=506991550 Ivan Lazarte

    The rest are in “American”.

  • Amanda

    It’s funny that these sites just don’t spring up in those countries. Orkut is a good example.. it’s been completely overrun by Portuguese speakers. It’s gotten bad enough that American users have been getting anti-American spam comments. Orkut is owned by Google, and American company.
    Same with hi5.com – they went viral – in the Phillipines. Facebook & Myspace are similar.

  • http://www.appbank.net/2010/02/25/iphone-news/96647.php [朝刊]Final Fantasy ことファイナルファンタジー 1,2 が出たぞ!

    [...] Only 50% Of Twitter Messages Are In English, Study Says 日本語がんばってるな。 [...]

  • http://www.facebook.com/profile.php?id=696264464 Dennis Wakabayashi

    Mojofiti converts tweets into several languages. You can see an example at my profile: http://twitter.com/dwakabayashi or sign up at http://www.mojolingual.com and have your tweets translated for free for the time being while we’re in beta.

    Sorry for the add here, but it seemed appropriate since some of that 50% is being done by us.

    HTH.

  • http://dailysocial.net/2010/02/25/twitter-siap-luncurkan-platform-iklan/ Twitter Siap Luncurkan Platform Iklan | DailySocial

    [...] mulai dengan” bisa nongol jadi Trending Topics. Baru-baru ini diungkapkan oleh Semiocast (via Techcrunch) bahwa hanya 50% tweet yang beredar menggunakan Bahasa Inggris. Apakah Twitter siap mengolah [...]

  • http://techcrunchies.com Anand Srinivasan

    I have doubts on this system..Their press release says that their software is used to identify the various writing scripts (and not languages). That could skew the results because a lot of people I know write their language using the English alphabet..

  • http://techcrunchies.com Anand Srinivasan
  • http://techcrunchies.com/tweet-language-break-up/ Language Share On Twitter — Tech Crunchies – Internet Statistics and Numbers

    [...] Here are the top five languages on Twitter and the share of tweets they represent [...]

  • http://rizkiharit.wordpress.com Rizki Harit

    +1 :)

    Saya harap twitter menambahkan fitur penerjemah

    (You can use Google translater if you don’t understand Bahasa)

  • http://huguesval.com/blog/2010/02/lol-la-nouvelle-barbie-sera-geek-et-utilisera-linux/ [lol] La nouvelle Barbie sera Geek et utilisera Linux | Hugues Valentin

    [...] : TechCrunch et [...]

  • http://clair.ro/flow/2010/02/25/daily-digest-for-february-25th/ Flow » Blog Archive » Daily Digest for February 25th – The zeitgeist daily

    [...] Shared Only 50% Of Twitter Messages Are In English. [...]

  • http://amanz.my/2010/02/bahasa-melayu-merupakan-bahasa-ke-empat-di-twitter/ Bahasa Melayu Merupakan Bahasa Ke-Empat Di Twitter | aMaNz

    [...] Bahasa Melayu Merupakan Bahasa Ke-Empat Di Twitter Web 2.0 Tiada Komen 25 Feb 2010Melalui suatu kajian ringkas, ianya didapati, Bahasa Melayu merupakan antara bahasa yang paling banyak digunakan dihalaman [...]

  • http://babeliaformacion.wordpress.com/2010/02/25/el-50-de-los-mensajes-de-twitter-estan-en-ingles/ El 50% de los mensajes de Twitter están en inglés « El blog de Babelia Formación

    [...] Leave a Comment  Si aún estáis con dudas sobre qué idioma se habla en Internet, el último estudio de Twitter os ha sacado de ellas. En una de las redes sociales más utilizadas del mundo el idioma [...]

  • http://blog.mohdisa.com/ mohdisa

    wo…malay language in 4th place. very nice. :)

  • http://www.technologygear.net/out-of-all-twitter-messages-only-50-are-in-english.html Out of All Twitter Messages Only 50% Are in English

    [...] in their local language in these places. The recent Twitter statistics study show that only about 50% of the messages and updates are in English which there are all other languages of the world taking various slots in Twitter update frequent [...]

  • http://dupola.wordpress.com/2010/02/26/links-for-2010-02-25/ links for 2010-02-25 « dupola's Collections

    [...] Only 50% Of Twitter Messages Are In English, Study Says (tags: TwitterData) [...]

  • http://www.idnforums.com/forums/25319-only-50%25-of-twitter-messages-are-in-english-study-says.html#post155925 Only 50% Of Twitter Messages Are In English, Study Says – IDN Forums – Internationalised Domain Names

    [...] 50% Of Twitter Messages Are In English, Study Says http://techcrunch.com/2010/02/24/twitter-languages/ __________________ Join the first IDN Domains Group on [...]

  • http://socialmediaatwork.com/2010/02/25/top-5-language-on-twitter-are-english-japanese-portuguese-malay-and-spanish/ Top 5 language on Twitter are English, Japanese, Portuguese, Malay and Spanish « Social Media at Work

    [...] More at TechCrunch. [...]

  • http://deathlock.wordpress.com/ Xaliber von Reginhild

    I’m also Indonesian and with all the Rana, Marsha, and “LO RETWEET” cases… I feel ashamed. As if Indonesia has no hi-quality tweets besides #IndonesiaUnite (and that thing is still blatantly meaningless).

    *facepalm*

  • http://dailysocial.net/2010/02/27/bahasa-melayu-terbesar-ke-4-di-twitter/ Bahasa Melayu Terbesar ke-4 di Twitter | DailySocial

    [...] akhirnya makin tak terbantahkan dengan dirilisnya laporan dari Semiocast seperti yang ditulis di Techcrunch, Bahasa Melayu merupakan bahasa ke-4 terbesar yang ada di Twitter. 6% dari pengguna Twitter [...]

  • http://zaryl.com zaryl

    haha.. bravo!

  • http://www.creativetraction.com/2010/02/28/the-future-of-social-media-monitoring/ The future of social media monitoring | Creative Traction

    [...] past week a report surfaced that only 50% of tweets are in English. The other 4 top languages are Japanese, Portuguese, Malay & Spanish. The release of this [...]

  • http://www.mchiche.com/la-nouvelle-barbie-sera-geek-et-utilisera-linux/ La nouvelle Barbie sera Geek et utilisera Linux |

    [...] : TechCrunch [...]

  • http://www.darkmatter101.org/site/2010/03/31/%ef%bb%bfdigitizing-race-lisa-nakamura/ Digitizing Race – Lisa Nakamura | darkmatter Journal

    [...] United States: From the 1960s to the 1980s, Routledge (1986), p.69 [↑]3.  A recent study indicates that English is no longer the majority language of Twitter. [↑]4.  For [...]

  • http://gorumors.com/crunchies/tweet-language-break-up/ Language Share On Twitter — Tech Crunchies

    [...] Here are the top five languages on Twitter and the share of tweets they represent [...]

  • http://www.iscamedia.com/index.php/2010/04/filmmakings-just-a-matter-of-ratio/ Filmmaking’s just a matter of ratio | isca media

    [...] other day, we tweeted out that only 50% of tweets are in English (Techcrunch). There’s more to this if we just analyse it for a moment. Considering that the [...]

  • Drew

    You should also try this new Chrome extenssion that will translate your feeds from Twitter and Facebook: Social Translate https://chrome.google.com/extensions/detail/giljl...

blog comments powered by Disqus
Advertisement
Got a tip? Building a startup? Tell us