Project Description
A federal project focused on providing high level expertise in AI/ML, data integration, data exchange, and management services on large volumes of data. We will be establishing the data governance strategy, data management, implementing required technology to enable an established web-based system to more efficiently process clinical data, making the healthcare system safer, higher quality, and resulting in improved patient safety in hospitals. It also involves providing subject matter expertise and advisory services on managing the data lifecycle in healthcare delivery and identifying opportunities for Advanced Analytics.
Responsibilities
- Develop, train, and deploy OCR models using AWS services or other cloud providers to extract text from electronic health records in PDF format.
- Optimize OCR accuracy for complex medical terminology and extract entities of interest from various document formats.
- Implement data preprocessing and enhancement techniques to improve OCR performance.
- Design and implement large language model (LLM) applications to interpret and answer questions related to patient health based on extracted text from medical records.
- Continuously fine-tune and improve LLM performance to improve accuracy.
- Utilize services such as Amazon Textract, Amazon Comprehend Medical, and Amazon SageMaker (or equivalent) for developing and deploying AI/ML models.
- Ensure scalable and efficient cloud-based solutions for processing large volumes of medical documents.
- Work closely with healthcare professionals, and software engineers to understand project requirements and deliver effective solutions.
- Communicate technical concepts and progress to non-technical stakeholders.