Menu

Data Scientist 2

at Pacific Northwest National Laboratory in Little Rock, Arkansas, United States

Job Description

Overview

The Physical and Computational Sciences Directorate (PCSD) researchers lead major R&D efforts in experimental and theoretical interfacial chemistry, chemical analysis, high energy physics, interfacial catalysis, multifunctional materials, and integrated high-performance and data-intensive computing.

PCSD is PNNL’s primary steward for research supported by the Department of Energy’s Offices of Basic Energy Sciences, Advanced Scientific Computing Research, and Nuclear Physics, all within the Department of Energy’s Office of Science.

Additionally, Directorate staff perform research and development for private industry and other government agencies, such as the Department of Defense and NASA. The Directorate’s researchers are members of interdisciplinary teams tackling challenges of national importance that cut across all missions of the Department of Energy.

Responsibilities

The successful candidate will have the opportunity to support the creation, and access to scientific data generated from computational chemistry and cheminformatics studies that advance drug discovery, molecular science and materials science research, among others. The candidate will apply their data science and data engineering skills to provide autonomous design, development, and support of web-facing applications tailored to the needs of these research users. They will contribute to the design/development of tools to simplify access to and utilization of the chemical data, be expected to work both independently and within a team to support software design/development activities, including requirements analysis, code development, testing, documentation, and deployment support.

Designs, develops, and implements methods, processes, and systems to analyze diverse data. Applies knowledge of statistics, machine learning, advanced mathematics, simulation, software development, and data modeling to integrate and clean data, recognize patterns, address uncertainty, pose questions, and make discoveries from structured and/or unstructured data. Produces solutions driven by exploratory data analysis from complex and high-dimensional datasets. Designs, develops, and evaluates predictive models and advanced algorithms that lead to optimal value extraction from the data. Demonstrates ability to transfer skills across application domains.

Qualifications

Minimum Qualifications:

+ BS/BA and 2 years of relevant experience OR

+ MS/MA OR

+ PhD

Preferred Qualifications:

The Scientist is expected to have strong technical writing skills and be able to present complex ideas related to chemical data analysis and development at project reviews and conferences. The candidate will also support collaborative interaction through efforts including written communications, demonstrations, and presentations about technical activities related to computational chemistry and data science. The candidate will work collaboratively within a team to execute on the full system development lifecycle, including analyzing user needs to determine technical requirements for chemical data analysis and visualization; developing technical specifications based on conceptual design and requirements; Identifies and evaluates new technologies or methods in data science, machine learning, and high-performance computing for implementation and continuous improvement of computational chemistry and cheminformatics workflows.

+ Extensive expertise in applying machine learning algorithms and packages to computational chemistry and cheminformatics problems, including but not limited to regression and classification algorithms, supervised/unsupervised learning techniques, Random Forest, SVM, and various neural networks.

+ Proficient in using deep learning frameworks such as sci-kit Learn, MATLAB, theano, Torch, and TensorFlow for developing predictive models and analyzing chemical data.

+ Solid experience working in Unix/Linux environments and High-Performance Computing and cloud computing settings, with a focus on running molecular modeling software packages.

+ Proficiency in high-level programming languages like Python, R, and Matlab, with a strong emphasis on scientific computing libraries (e.g., NumPy, SciPy, Pandas) and cheminformatics toolkits (e.g., RDKit, OpenBabel, ChemoPy) for processing and analyzing chemical data.

+ Strong background and hands-on experience in autonomous science, AI/ML, data science, and Natural Language Processing, along with a solid foundation in chemistry, materials science, cheminformatics, and computational chemistry/biology. Skilled in applying these techniques to accelerate drug discovery, materials design, and reaction prediction.

+ Proficient in using cheminformatics databases and tools for managing, querying, and analyzing large chemical datasets. Skilled in applying data mining and machine learning techniques to extract meaningful insights and build predictive models from these datasets.

+ Some grasp of software engineering concepts, including design patterns, data structures, and algorithm optimization, with a focus on developing efficient and maintainable code for computational chemistry and cheminformatics workflows. Skilled in applying these concepts to develop high-performance software tools for molecular modeling, virtual screening, and QSAR/QSPR analysis.

+ Strong understanding of statistical methods and data visualization techniques for analyzing and interpreting computational chemistry results.

+ Excellent communication and collaboration skills, with the ability to work effectively in a multidisciplinary team of computational chemists, cheminformaticians, data scientists, and software engineers. Experienced in writing scientific papers, presenting research findings at conferences, and communicating complex technical concepts to both technical and non-technical audiences.

Hazardous Working Conditions/Environment

Not applicable

Additional Information

Not applicable

Testing Designated Position

This is not a Testing Designated Position (TDP)

About PNNL

Pacific Northwest National Laboratory (PNNL) is a world-class research institution powered by a highly educated, diverse workforce committed to the values of Integrity, Creativity, Collaboration, Impact, and Courage. Every year, scores of dynamic, driven people come to PNNL to work with renowned researchers on meaningful science, innovations and outcomes for the U.S. Department of Energy and other sponsors; here is your chance to be one of them!

At PNNL, you will find an exciting research environment and excellent benefits including health insurance, flexible work schedules and telework options. PNNL is located in eastern Washington State-the dry side of Washington known for its stellar outdoor recreation and affordable cost of living. The Lab’s campus is only a 45-minute flight (or ~3-hour drive) from Seattle or Portland, and is serviced by the convenient PSC airport, connected to 8 major hubs.

Commitment to Excellence, Diversity, Equity, Inclusion, and Equal Employment Opportunity

Our laboratory is committed to a diverse and inclusive work environment dedicated to solving critical challenges in fundamental sciences, national security, and energy resiliency. We are proud to be an Equal Employment Opportunity and Affirmative Action employer. In support of this commitment, we encourage people of all racial/ethnic identities, women, veterans, and individuals with disabilities to apply for employment.

Pacific Northwest National Laboratory considers all applicants for employment without regard to race, religion, color, sex (including pregnancy, sexual orientation, and gender identity), national origin, age, disability, genetic information (including family medical history), protected veteran status, and any other status or characteristic protected by federal, state, and/or local laws.

We are committed to p

Copy Link

Job Posting: JC263618294

Posted On: Aug 03, 2024

Updated On: Aug 08, 2024

Please Wait ...