🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
❄️ Nix-based dotfiles, claude code configs, and system settings for macOS & NixOS, which makes everyday software development fun!