ocr-phi-masking

ocr-phi-masking

databricks-industry-solutions

Our joint Solution Accelerator with John Snow Labs automates the detection of sensitive information contained within unstructured data using NLP models for healthcare. Extracted data is stored within the Lakehouse, where teams can use the pre-trained models to easily remove, obfuscate or mask data for downstream analytics at massive scale.

7 Stars
5 Forks
7 Watchers
Python Language
other License
Cost to Build
$3.0K
Market Value
$3.2K

Growth over time

3 data points  ·  2022-11-01 → 2025-04-01
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about ocr-phi-masking

Question copied to clipboard

What is the databricks-industry-solutions/ocr-phi-masking GitHub project? Description: "Our joint Solution Accelerator with John Snow Labs automates the detection of sensitive information contained within unstructured data using NLP models for healthcare. Extracted data is stored within the Lakehouse, where teams can use the pre-trained models to easily remove, obfuscate or mask data for downstream analytics at massive scale.". Written in Python. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone ocr-phi-masking

Clone via HTTPS

git clone https://github.com/databricks-industry-solutions/ocr-phi-masking.git

Clone via SSH

[email protected]:databricks-industry-solutions/ocr-phi-masking.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the ocr-phi-masking issue tracker:

Open GitHub Issues