Steemit's adoption in non English speaking world

I have been analyzing top posts by number of views from the last week, when I noticed something peculiar.

bug.png

Do you see it?

There are several posts on the leaderboard with funny looking identifiers. So I went to investigate, and after visiting the posts in question, I found out they were all in Korean. The reason why the identifiers look malformed is because the slugify algorithm that is used by steemit is filtering out all non-ASCII characters (while Korean characters are probably UTF-8).

This sparked my curiosity further. I wanted to know how many different languages are being used on Steemit, and implicitly (albeit not accurately), how many national communities do we have here.

Results

I've collected all the posts from the last 7 days (only posts with votes and comments were included), and ran their contents trough Google's language detection algorithm. Here are the results.

newplot.png

It looks like Korean speaking community is the second largest on Steemit, with 5.2% 'market-share'. Followed by Spanish, German, Indonesian, Croatian and Polish.

Homework for the curious reader:
Try to find out how many people in the world speak these languages, and re-normalize the percentages accordingly
.

Thanks

I have also noticed we have several 'ambassadors' who are collaborating with whales and otherwise supporting the growth on non-English communities. Thank you for your dedicated efforts, and I hope more people will take the initiative like you did, to make Steemit a truly globalized system.

H2
H3
H4
3 columns
2 columns
1 column
Join the conversation now
Logo
Center