Position

Infraestructure Engineer

Location Prague and Argentina
Starts As soon as possible
Status Full Time

We are seeking an infrastructure engineer to be a key member of a world-class data pipeline team. If elegant design, copy-by-value semantics and generic programming operating at scale appeal to you, then we should talk.

Our engineering team works on a greenfield data processing pipeline that leverages modular, composable, idiomatic C++1x and python to keep things simple and efficient. We haven’t ruled out more functional solutions like Haskell yet, we are just missing someone like you to convince us.

Our engineering team collects and analyzes data on the evolving security posture/state of the Internet. The problems we tackle are global in scale and must be handled in an efficient and timely manner. Some of the key challenges in our environment are keeping solutions simple and composable so we can reason about them at scale. This is the hallmark of functional programming, high-performance computing and large scale distributed systems design.

Ideal Attributes

  • Informed point of view on analysis, design, and troubleshooting of large scale distributed systems. This starts with developing a deep understanding of the problem.
  • Preference for minimally simplistic and highly composable design
  • Strategies to avoid cloud service provider lock-in
  • Reasoned approach to black-box analysis and troubleshooting

Responsibilities

  • Analysis, design and troubleshooting of our data distribution and processing architecture
  • Key developer of Bluepipe our native processing pipeline
  • Primary advocate for tool introduction and removal
  • Key party in system selection and configuration
  • Clear articulated communication (drawings / presentations / writeups) with engineering, operations and product management staff
  • Mentorship of junior and mid-career engineers

Tools

  • Data definition, format and interfaces
  • Definitions – Protobuf V3
  • Normalize from – AVRO / JSON / XML / CSV
  • Normalize to – Protobuf / ORC
  • Interfaces – REST API(s), gRPC and object store buckets
  • Databases – Postgres / Presto
  • Languages – Python / C++14 / Scala / Go-lang
  • Job Orchestration – HT Condor / Apache Airflow
  • Analytics – Spark / Databricks / Bluepipe (native)
  • Storage – Gluster / NFS / Object Stores
  • Computation – Containers / VMs / Metal

Interested?

Send an email to

teptalent@idealunchbox.io

Make sure you have the following:

  • The job title in the subject of the email
  • 1 paragraph with 3 bullet points explaining why you’re qualified for this position.
  • Attach your resume