Software Engineer - Lakeflow PhD Candidates
Company: Databricks Inc.
Location: Mountain View
Posted on: May 17, 2025
|
|
Job Description:
Job Description: Data Engineer - Lakeflow Team at
DatabricksDatabricks is transforming the data lifecycle, from
ingestion to generative AI, across multiple clouds with a unified
platform. We serve over 10,000 customers, processing exabytes of
data daily on more than 15 million VMs, and our growth continues
rapidly.The Lakeflow team is seeking recent PhD graduates to work
on products like Apache Spark Structured Streaming, Delta Live
Tables (DLT), and Materialized Views. Structured Streaming is one
of the world's leading streaming engines. DLT simplifies the
development and management of reliable batch and streaming data
pipelines, providing high-quality data on the Databricks Lakehouse
Platform. It streamlines ETL processes with declarative pipeline
development, automatic data testing, and comprehensive monitoring
and recovery features. DLT also enhances pipeline execution through
logical and physical optimizations, including instance type
selection and autoscaling.Additionally, we have developed Enzyme, a
new catalyst optimization layer designed to accelerate ETL
processes by enabling incremental computation and materialization
of intermediate results. Enzyme maintains up-to-date
materializations of query results stored in Delta tables by
employing a cost model that chooses between various optimization
techniques inspired by traditional materialized view maintenance,
delta streaming, and common ETL patterns.As part of the Lakeflow
DLT team, you'll have opportunities to innovate in areas such
as:
#J-18808-Ljbffr
Keywords: Databricks Inc., Berkeley , Software Engineer - Lakeflow PhD Candidates, IT / Software / Systems , Mountain View, California
Click
here to apply!
|