This is a graph of the frequency of Tweets by character length (1 to 140 characters) from the 500 million Tweet sample used by Scott Golder and Vladimir Barash (sample extracted from Twitter in late 2009) in their academic papers. View this graph in comparison to a Twitter employee’s set of graphs that describe the typical length of a Tweet. Current Tweets have a number of changes (different text processing in the twitter-text API, wrapping of URLs, different application distributions, etc.) Note that the early peak seen in the current Twitter graphs is much less pronounced. Another note is that this graph shifts a little depending on how the Tweets are processed. More on that some other time.Thanks to Scott Golder and Vladimir Barash for letting me use this data. 

This is a graph of the frequency of Tweets by character length (1 to 140 characters) from the 500 million Tweet sample used by Scott Golder and Vladimir Barash (sample extracted from Twitter in late 2009) in their academic papers. View this graph in comparison to a Twitter employee’s set of graphs that describe the typical length of a TweetCurrent Tweets have a number of changes (different text processing in the twitter-text API, wrapping of URLs, different application distributions, etc.) Note that the early peak seen in the current Twitter graphs is much less pronounced. Another note is that this graph shifts a little depending on how the Tweets are processed. More on that some other time.

Thanks to 
Scott Golder and Vladimir Barash for letting me use this data. 

Blog comments powered by Disqus