Major Responsibilities:
-
- Monitors and ensures the integrity, quality, and security of genomic data stored in the data warehouse.
- Develops, tests, and debugs moderately complex SQL and routine R or Python code to deliver reproduceable analysis ready datasets. Documents processes and rationales with code. Provides data dictionaries and descriptive statistics with datasets as appropriate.
- Maintains documentation for data workflows, pipelines, and warehouse architecture.
- Troubleshoots data pipeline issues and optimize performance.
- Stays current with industry best practices and emerging technologies in genomics and data engineering.
- Assists with defining functional requirements, formulating technical specifications, and researching and evaluating alternatives for moderately complex custom applications and processes that support research.
- Participates in data mapping and cleaning activities. Makes recommendations and improves data quality within research data infrastructure.
- Ensures datasets are accessible and provided to research investigators and stakeholders in a manner consistent with Health Insurance Portability and Accountability Act (HIPAA), Institutional Review Board (IRB), legal agreements, policies and procedures, and other appropriate standards.
Licensure, Registration, and/or Certification Required:
- Epic Caboodle Data Model certification issued by Epic. needs to be obtained within 6 months, and
- Epic Clarity Data Model certification issued by Epic (CLR110). needs to be obtained within 6 months, and
- Epic Clinical Data Model certification issued by Epic (COG240). needs to be obtained within 6 months.
Education Required:
- Bachelor's Degree in Computer Science or related field.
Experience Required:
Knowledge, Skills & Abilities Required:
-
- Knowledge of genomic data formats (e.g., VCF, FASTQ), genomic data management, and genomic research methods
- Experience with SQL, R programming languages. Python and Bash preferred.
- Familiarity with genomic analysis tools such as SAMtools, BEDtools, DESeq2
- Ability to incorporate new technical and analytical skills into current capabilities.
- Ability to think analytically, logically, and use creativity.
- Ability to work independently, or as part of a team and balance multiple priorities.
#Remote
#LIRemote
Preferred remote locations in IL, WI, NC, GA
Fully Remote Role from these states: AL, AK, AR, AZ, DE, FL, GA, IA, ID, IL, IN, LA, KS, KY, ME, MI, MO, MS, MT, NC, ND, NE, NH, NM, NV, OH, OK, PA, SC, SD, TN, TX, UT, VA, WI, WV, WY.
Due to complex requirements, remote work is NOT permitted for short or long periods in: CA, CO, CT, HI, MA, MD, MN, NJ, NY, OR, RI, VT, WA and working Internationally (this includes working while on vacation).
No relocation, No Sponsorship or transfer of visa for this position.
Physical Requirements and Working Conditions:
- Position may require travel which may result in exposure to road and weather hazards.
- Exposed to normal office environment.
- Operates all equipment necessary to perform the job.
This job description indicates the general nature and level of work expected of the incumbent. It is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities required of the incumbent. Incumbent may be required to perform other related duties.