Name | Date | Size | #Lines | LOC | ||
---|---|---|---|---|---|---|
.. | - | - | ||||
data/ | 02-Apr-2025 | - | 50,004 | 50,003 | ||
src/ | 02-Apr-2025 | - | 288 | 214 | ||
tests/ | 02-Apr-2025 | - | 29 | 23 | ||
.editorconfig | D | 01-Apr-2025 | 337 | 19 | 14 | |
.php-cs-fixer.dist.php | D | 01-Apr-2025 | 212 | 14 | 11 | |
CHANGELOG.md | D | 01-Apr-2025 | 327 | 16 | 10 | |
LICENSE | D | 01-Apr-2025 | 1 KiB | 22 | 17 | |
README.md | D | 01-Apr-2025 | 625 | 26 | 17 | |
composer.json | D | 01-Apr-2025 | 1.1 KiB | 47 | 46 | |
composer.lock | D | 01-Apr-2025 | 136.5 KiB | 3,924 | 3,923 | |
phpstan.neon | D | 01-Apr-2025 | 66 | 6 | 5 | |
rector.php | D | 01-Apr-2025 | 569 | 25 | 18 |
README.md
1# TikToken clone for PHP - PHP GPT3 tokenizer 2 3PHP Text Tokenizer for GPT models 4 5## About 6 7A PHP toolkit to tokenize text like GPT family of models process it. 8 9Forked from [semji/gpt3-tokenizer-php](https://github.com/semji/gpt3-tokenizer-php) to bug fixes and improvement. 10 11## Requirements 12* PHP 8.1 13* mbstring extension [details here on how to install mbstring](https://www.php.net/manual/en/mbstring.installation.php) 14## Usage 15 16```php 17use TikToken\Encoder; 18$prompt = "Ai is cool"; 19$encoder = new Encoder(); 20 21$tokens = $encoder->encode($prompt); // [32, 72, 318, 3608] 22 23// Get tokens count: 24echo count($tokens); // 4 25``` 26