Bespoke Technologies, Inc.

Data Engineer

Herndon/Chantilly, VA - Full Time

BT-66 – Data Engineer
Skill Level: Expert
Location: Herndon/Chantilly (fully on-site, no remote option)
 

**MUST HAVE A POLY CLEARANCE TO APPLY**



Develop new tools, code, and services to execute data engineering activities involving data of varying types and in varying conditions. Activities include the following tasks: Movement of structure and unstructured data using approved methods. Execute data ingestion activities for storing data in a local or enterprise level location. Develop code to format data that supports exploration. Analyze source data formats and work with Data Scientists and partners to determine the formats and transforms that best meet objectives. Develop code and tools to provide one-time and on-going data extraction from various repositories, formatting and transformations into enterprise or standalone data models. Develop new ETL and perform O&M and enhancements on existing ETL code using best practices/standards. Develop and deliver documentation for each project including ETL mappings, code use guide, code location and access instructions.
  • Design and optimize Data Pipelines using tools such as Spark, Apache Iceberg, Trino, OpenSearch, EMR cloud services, NiFi and Kubernetes containers
  • Ensure the pedigree and provenance of the data is maintained such that the access to data is protected
  • Clean and preprocess data to enable access for advanced analytics
  • Collaborate with enterprise working groups to advance the state of data standards
  • Collaborate with the engineering team, data stewards, and partners to aid in getting actionable value out of the data holdings
  • Collaborate with software engineers to update, configure, and maintain data services based on the requirements
  • Ensure data quality by working with the testing and data quality team to enhance standardization of data conditioning pipelines 
  • Experience adapting to various types and formats of data, and working with development teams to integrate new data processing platforms 
Required Skills:
10+ years' experience with:
  • Data lifecycle engineering
  • Development and maintenance of extract, transform and load (ETL) tools and services
  • Cloud and on-prem data storage and processing solutions
  • Python, SQL, Spark and other data engineering programming
  • COTS and open source data engineering tools such as ElasticSearch and NiFi
  • Processing data within the Agile Lifecycle 
Apply: Data Engineer
* Required fields
First name*
Last name*
Email address*
Phone number*
Resume*

Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or paste resume

Paste your resume here or attach resume file

Are you a US citizen?*
Do you live within Virginia, Maryland, or DC?*
Do you have an active Poly clearance?*
What is your Poly Type?*
When was your Poly clearance granted?*
There is no option for remote work and all support is fully on-site. Which location do you prefer?*
When was your last BI?*
Human Check*