• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..--

.github/workflows/18-Oct-2025-4027

data/18-Oct-2025-50,00450,003

src/18-Oct-2025-288214

tests/18-Oct-2025-2923

.editorconfigD18-Oct-2025337 1914

.gitignoreD18-Oct-202527 32

.php-cs-fixer.dist.phpD18-Oct-2025212 1411

CHANGELOG.mdD18-Oct-2025327 1610

LICENSED18-Oct-20251 KiB2217

README.mdD18-Oct-2025625 2617

composer.jsonD18-Oct-20251.1 KiB4746

composer.lockD18-Oct-2025136.5 KiB3,9243,923

phpstan.neonD18-Oct-202566 65

rector.phpD18-Oct-2025569 2518

README.md

1# TikToken clone for PHP - PHP GPT3 tokenizer
2
3PHP Text Tokenizer for GPT models
4
5## About
6
7A PHP toolkit to tokenize text like GPT family of models process it.
8
9Forked from [semji/gpt3-tokenizer-php](https://github.com/semji/gpt3-tokenizer-php) to bug fixes and improvement.
10
11## Requirements
12* PHP 8.1
13* mbstring extension [details here on how to install mbstring](https://www.php.net/manual/en/mbstring.installation.php)
14## Usage
15
16```php
17use TikToken\Encoder;
18$prompt = "Ai is cool";
19$encoder = new Encoder();
20
21$tokens = $encoder->encode($prompt); // [32, 72, 318, 3608]
22
23// Get tokens count:
24echo count($tokens); // 4
25```
26