📊 Token Count (tc) 🦀

A simple and efficient token count program written in Rust! 🚀

📝 Description

This Rust implementation of the classic wc (word count) command-line tool allows you to count lines, words, characters, and even tokens in text files or from standard input. It's fast, reliable, and supports Unicode! 🌍✨

🎯 Features

Count lines 📏
Count words 🔤
Count characters (including multi-byte Unicode characters) 🔡
Count tokens using various tokenizer models 🔢
Process multiple files 📚
Read from standard input 🖥️
Supports various languages (English, Korean, Japanese, and more!) 🌐

🛠️ Installation

There are two ways to install tc:

Option 1: Install from source

Make sure you have Rust installed on your system. If not, get it from rust-lang.org 🦀

Clone this repository:

git clone https://github.com/guuzaa/tc.git
cd tc

Build the project:
```
cargo build --release
```
The executable will be available at target/release/tc

Option 2: Install pre-built binaries

Go to the Releases page of the tc repository.
Download the latest release for your operating system and architecture.
Extract the downloaded archive.
Move the tc executable to a directory in your system's PATH (e.g., /usr/local/bin on Unix-like systems).
You can now use tc from anywhere in your terminal!

🚀 Usage

Options:

-l, --lines: Show line count 📏
-w, --words: Show word count 🔤
-c, --chars: Show character count 🔡
-t, --tokens: Show token count 🔢
--model <MODEL>: Choose tokenizer model (default: gpt3)

Available models:

gpt3: r50k_base
edit: p50k_edit
code: p50k_base
chatgpt: cl100k_base
gpt4o: o200k_base

If no options are specified, all counts (lines, words, characters, and tokens) will be shown.

Examples:

Count lines, words, and characters in a file:
```
tc example.txt
```
Count only words in multiple files:
```
tc -w file1.txt file2.txt file3.txt
```
Count lines and characters from standard input:
```
echo "Hello, World!" | tc -lc
```
Count tokens using the ChatGPT tokenizer:
```
tc -t --model chatgpt example.txt
```
Count everything in files with different languages:
```
tc english.txt korean.txt japanese.txt
```

🤝 Contributing

Contributions are welcome! Feel free to submit issues or pull requests. 🎉

📜 License

This project is licensed under the MIT License. See the LICENSE file for details. 📄

🙏 Acknowledgements

The Rust community for their amazing tools and support 🦀❤️
The original Unix wc command for inspiration 🖥️
The editor Cursor 🤖

Happy counting! 🎉📊🚀

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
.vscode		.vscode
docs		docs
locales		locales
src		src
tests		tests
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📊 Token Count (tc) 🦀

📝 Description

🎯 Features

🛠️ Installation

Option 1: Install from source

Option 2: Install pre-built binaries

🚀 Usage

Options:

Examples:

🤝 Contributing

📜 License

🙏 Acknowledgements

About

Releases 4

Languages

License

guuzaa/tc

Folders and files

Latest commit

History

Repository files navigation

📊 Token Count (tc) 🦀

📝 Description

🎯 Features

🛠️ Installation

Option 1: Install from source

Option 2: Install pre-built binaries

🚀 Usage

Options:

Examples:

🤝 Contributing

📜 License

🙏 Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 4

Languages