LeadStack Inc. is an award-winning, one of the nation’s fastest-growing,
certified minority-owned (MBE) staffing services provider of contingent
workforce. As a recognized industry leader in contingent workforce solutions
and Certified as a Great Place to Work, we’re proud to partner with some of
the most admired Fortune 500 brands in the world.
**Title: Python Developer with Machine learning
Location: San Francisco, CA (Fully Remote)
Duration: 6 - 12 months contract
Direct Banking Client
Immediate interview
Job Description:
**Building from the success of the Sepsis initiative thus far, *** Diagnostics
is creating a data analytics and science team to generate novel insights and
real world evidence (RWE) from real world data (RWD) that contain lab results
generated by DHR Dx OpCo instruments linked to patient medical records. In
this role, you will be responsible for driving technical execution and and
helping us create a best-in-class data & analytics organization. You will work
with various stakeholders both inside and outside the organization to execute
on our research initiatives.
**Responsibilities:
** • Collaborate with stakeholders to understand data requirements for ML,
Data Science and Analytics projects.
• Assemble large, complex data sets from disparate sources, writing code,
scripts, and queries, as appropriate to efficiently extract, QC, clean,
harmonize and visualize Big Data sets.
• Develop software tools for data engineering tasks following industry
standard practices such as agile, test-driven development, and CI/CD and
deliver them to internal customers.
• Write pipelines for optimal extraction, transformation, and loading of data
from a wide variety of data sources using Python, SQL, Spark, AWS, and Azure
‘big data’ technologies.
• Identify, design, and implement continuous process improvements such as
automating manual processes and optimizing data delivery.
• Document data processes, write data management recommended procedures, and
create training materials relating to data management best practices. Required
Qualifications
• Expertise in Python with strong knowledge of Python best practices,
particularly as they apply to the area of data & analytics, and experience
using standard data science toolkits • Previous experience performing data
engineering tasks on RWD/RWE projects involving Electronic Medical Records
(EMR) data
• Experience with defining clinical protocols to collect RWD
• Advanced SQL knowledge and experience working with relational databases,
query authoring (SQL) as well as working familiarity with a variety of
databases.
• Familiarity with healthcare data standards, data ontologies, toolchains, and
operating procedures
• An associate who is independent, self-motivated, and eager to excel in a
goal-oriented and multi-faceted work environment.
• Someone who embraces uncertainty and thrives by driving to clarity in a
fast-paced ambiguous environment
• Excellent written and verbal communication skills and the ability to clearly
articulate project goals, timelines, and key milestones, and accomplishments
to stakeholders. Desired Qualifications
• Experience with digital health, AI/ML algorithms, Clinical Decision Support
(CDS) products
• Experience with developing software solutions on public cloud
infrastructures, particularly Amazon AWS and Microsoft Azure
• Knowledge of security and privacy requirements and certifications, including
HIPAA, GDPR, ISO9001, ISO27001
• Experience with a variety of standard Python data visualization methods
• Knowledge of cluster computing and large-scale data analytics platforms such
as Apache Spark
• Knowledge of shared data science development and analytics environments such
as JupyterLab.
To know more about current opportunities at LeadStack, please visit us at
https://leadstackinc.com/careers/
Should you have any questions, feel free to call me on or send an email on
suman.mishra@leadstackinc.com **Suman Mishra** Team Lead C.650-850-8282 A.611
Gateway Blvd, Ste 120
South San Francisco, CA 94080W.www.leadstackinc.com