A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
The code used to create and update the Open Australian Legal Corpus, the first and only multijurisdictional open corpus of Australian legislative and judicial documents.
An easy-to-use Python library for merging PyTorch models.
The training code behind EmuBert, the largest open-source masked language model for Australian law.
Scripts for evaluating extractive question answering models on the LegalQAEval legal question answering benchmark.