Data Engineer

McLean, VA
Full Time
Experienced
BT-60 – Data Engineer
Location: McLean


**MUST HAVE A POLY CLEARANCE TO APPLY**



              
Position Summary         
Looking for a Data Engineer to assist with strategic planning and oversee implementation of the cloud-based data environment.

Job Description
The team requires support developing and maintaining a cloud-based data environment to transport, store, extract, transform, and load (ETL), and disseminate data solutions. The team needs experienced support in data engineering, cloud architecture, and application development. The work includes engaging regularly with data scientists, analysts, and managers. The Data Engineer will assist with strategic planning and oversee implementation of the cloud-based data environment, to include mapping of data sources and access controls. They will develop code, data models, and documentation to standards; provide systems administration and programming support for ETL processes and data infrastructure efforts; and train and conduct knowledge transfer to team members on issues and technologies related to ETL process, on premise high capacity compute cluster, and administrative duties. The Data Engineer will coordinate with external data and platform providers to ensure the smooth functioning of the systems and data flows, and to accomplish any needed changes and coordinate with experts to assist with technical aspects required to acquire new datasets or data management technologies for inclusion in the organization's environment. They will also support the cross-domain transfer and integration of data.

Technologies/Tools        
  1. (Mandatory) Demonstrated experience serving as a technical liaison between system engineers, data engineers, data scientists, analysts, and non-technical managers and personnel.
  2. (Mandatory) Demonstrated experience with AWS cloud services, including long-term storage options, and cloud-based database services such as Databricks or Elastic MapReduce (EMR).
  3. (Mandatory) Demonstrated experience with SQL database structures and mapping between SQL databases.
  4. (Mandatory) Demonstrated experience in large-scale data migration efforts.
  5. (Mandatory) Demonstrated experience with database architecture, performance design methodologies, and system-tuning recommendations. Preference for familiarity with Glue, Hive, and Iceberg or similar
  6. (Mandatory) Demonstrated experience with Python, Bash, and Terraform
  7. (Mandatory) Demonstrated experience with DevSecOps solutions and tools
  8. (Mandatory) Demonstrated experience implementing CI/CD pipelines using industry standard process
  9. (Desired) Demonstrated experience with the organization's data environment and on-premises compute structure.
  10. (Desired) Demonstrated experience with Data Quality and Data Governance concepts and experience.
  11. (Desired) Demonstrated experience maintaining, supporting, and improving the ETL process through the implementation and standardization of data flows with Apache Nifi and other ETL tools.
  12. (Desired) Demonstrated experience with Apache Spark
Share

Apply for this position

Required*
Apply with Indeed
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Human Check*