An elegant PyTorch deep reinforcement learning library.
A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions through 34 interactive tasks