Forward Deployed Data Engineer

Engineering · Full-time · United States · Remote possible

Job description

About Sayari:  Sayari is the counterparty and supply chain risk intelligence provider trusted by government agencies, multinational corporations, and financial institutions. Its intuitive network analysis platform surfaces hidden risk through integrated corporate ownership, supply chain, trade transaction and risk intelligence data from over 250 jurisdictions. Sayari is headquartered in Washington, D.C., and its solutions are used by thousands of frontline analysts in over 35 countries.

Our company culture is defined by a dedication to our mission of using open data to enhance visibility into global commercial and financial networks, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you.

Positions Description: Sayari’s flagship product, Sayari Graph, provides instant access to structured business information from hundreds of millions of corporate, legal, and trade records. As a member of Sayari's data team you will work with our Product and Software Engineering teams to collect data from around the globe, maintain existing ETL pipelines, and develop new pipelines that power Sayari Graph. 

Job Responsibilities:

  • Working directly with our clients to help them ETL their data into a format which is usable by Sayari’s on-premise offering
  • Working with customers pre-sales to help them design solutions focused around Sayari’s product offerings for Entity Resolution and bulk data
  • Working with customers post-sale to ensure that they are getting value from Sayari’s bulk data product
  • Managing the process of producing customized bulk data products for customers and bulk data samples for prospective customers

Required Skills & Experience:

  • Professional experience with Python and a JVM language (e.g., Scala)
  • 4+ years of experience designing and maintaining ETL pipelines
  • Experience using Apache Spark
  • Experience with SQL (e.g., Postgres) and NoSQL databases (e.g., Cassandra, ElasticSearch, etc.)
  • Experience working on a cloud platform like GCP, AWS, or Azure
  • Experience working collaboratively with git

Desired Skills & Experience:

  • Experience working directly with clients of your company; not solely working with internal stakeholders
  • Understanding of Docker/Kubernetes
  • Understanding of or interest in knowledge graphs
  • Experienced in supporting and working with internal teams and customers in a dynamic environment
  • Passionate about open source development and innovative technology
A panel showing how The Org can help with contacting the right person.

Open roles at Sayari