Data Engineer

  • United States
  • K1x

Full-Time (Exempt)

Fully Remote Position

Preferred Locations: Central Time Zone or Eastern Time Zone

Who We Are:

We are K1X. Our technology is used by the nation’s largest institutional investors, funds, and accounting firms, by bringing long-established solutions that are creating an all-digital K-1 experience. Our goal is to transform the K-1 industry by moving a traditionally PDF-based process to an all-digital experience via our software solutions. Join us at the start of something exciting!

What Are We Looking For?

We are seeking a highly skilled and experienced Staff Data Engineer to join our dynamic team. The ideal candidate will be comfortable working across various data engineering tasks, from building and optimizing data pipelines to designing and implementing scalable data infrastructure. As a Staff Data Engineer, you will play a crucial role in supporting our machine learning models and ensuring that our systems are robust, efficient, and scalable. 

With K1X You Will:

  • Design, build, and maintain scalable and efficient data pipelines to support machine learning models and analytics. 
  • Collaborate with data scientists, software engineers, and other stakeholders to understand data needs and deliver appropriate solutions. 
  • Implement best practices for data governance, data quality, and data lifecycle management 
  • Mentor and guide junior team members, fostering a collaborative and innovative work environment. 

Requirements

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field 
  • 6+ years of relevant industry experience as a data engineer, with a focus on unstructured and semi-structured data (e.g. financial documents) 
  • Experience with various data storage solutions (SQL, NoSQL, data lakes, data warehouses, ...) as well as cloud platform offerings and vendor solutions (for example, Azure Cosmos, GCP BigQuery, DataBricks, ...) 
  • Excellent problem-solving skills with the ability to synthesize and communicate complex technical results to senior leaders and nontechnical audiences 
  • Proficiency in Python and familiarity with machine learning frameworks and libraries (e.g. scikit-learn, PyTorch) 

Preferred Experience: 

  • Previous experience with applications of NLP to financial documents 
  • Familiarity with alternative investment accounting needs 
  • Prior experience managing a moderate-sized (300k+ documents) training corpus for language models 

Benefits

·       Unlimited Vacation Policy + Sick Time + Holidays

·       Paid Parental Leave

·       Fully Remote Opportunity

·       Healthcare Benefits and 401K

· Growing Startup Culture