Skip to content

Twitter Text Libraries. This code is used at Twitter to tokenize and parse text to meet the expectations for what can be used on the platform.


Notifications You must be signed in to change notification settings


Folders and files

Last commit message
Last commit date

Latest commit



This repository is a collection of libraries and conformance tests to standardize parsing of Tweet text. It synchronizes development, testing, creating issues, and pull requests for twitter-text's implementations and specification. These libraries are responsible for determining the quantity of characters in a Tweet and identifying and linking any URL, @username, #hashtag, or $cashtag.

See implementations and conformance in this repository below:

Other language implementations

The following implementations exist in other programming languages, but are not supported by or used by Twitter. We'd like to thank the authors for building and maintaining these alternatives.

If you would like to contribute a link to other implementations, please consider sending a Pull Request, or letting us know via the Twitter Developer Community forums.

Copyright and License

Copyright 2012-2020 Twitter, Inc and other contributors

Licensed under the Apache License, Version 2.0: