Aplikuj teraz

Remote Senior Data Engineer

Varwise

Remote +6 więcej
36726 - 42234 PLN
B2B
Scala
💼 B2B

Must have

  • Spark

  • AWS

  • Linux

  • NoSQL

  • SQL

  • Kafka

  • Scala

  • Neo4j

  • Databricks

  • PySpark

  • LLM

  • GCP

  • English (Fluent)

Nice to have

  • Kinesis

  • Airflow

  • Jenkins

  • Python

  • TensorFlow

  • Parquet

  • Delta Lake

Requirements description

  • 8+ years of professional software engineering experience, with a focus on data engineering in big data environments.
  • 4+ years of experience in developing and delivering production-grade Scala based systems , familiarity with Python, and at least one other high-level programming language (e.g., Java, C++, C#).
  • Proficiency in all aspects of SDLC, from concept to running production systems
  • Proficiency using Spark (PySpark) or Tensorflow
  • Proven experience building and optimizing large-scale data pipelines using Databricks and Spark.
  • Experience participating in ETL and ML pipeline projects based on Airflow, Kubeflow, Mleap, Sagemaker or similar
  • Hands-on experience developing and deploying data solutions in a major cloud platform (AWS, GCP, or Azure).
  • Experience working with AI, LLMs, Agents, and/or Generative AI technologies, both in product applications and for development productivity
  • Database experience at large scale, both SQL and NOSQL databases like Postgresql, Cassandra, Neo4j, Neptune, or similar
  • Experience in large scale data management formats and frameworks such as Parquet ORC, Databricks / Delta Lake, Iceberg or Hudi
  • Bachelor’s degree in Computer Science or related discipline

Offer description

We are looking for Data Engineers to work remotely for an Adtech company that leverages machine learning and data science to build an identity graph that can scale to reach millions of users via brands with programmatically selected households. The work includes scaling our Big Data asset that combines billions of transaction data points including intent, conversions, first party data into an identity graph that needs to scale to a future cookie less world

We value technical excellence and you will have both resources and time to deliver world-class code.

This is a 100% remote position, You will be working with team members in NYC.

If you like solving hard and technically challenging problems, join us to use those skills here to create real-time, concurrent, globally distributed systems applications and services.

Your responsibilities

  1. Work on creating and maintaining reliable and scalable distributed data processing systems
  2. Become a core maintainer of the data lake
  3. Maintain our data lake by building searchable data sets for broader business uses
  4. Scale, troubleshoot and fix existing applications and services
  5. Own a complex set of services and applications
  6. Focus ensuring that our data pipelines run 24/7

show all (11)

Wyświetlenia: 2
Opublikowana20 dni temu
Wygasaza 28 dni
Rodzaj umowyB2B
Źródło
Logo

Podobne oferty, które mogą Cię zainteresować

Na podstawie " Remote Senior Data Engineer "