Data Engineer - Data Platform (Berkeley, California or US Remote)

Company: UrbanFootprint
Location: Berkeley
Posted on: May 7, 2022

Job Description:

Job Description UrbanFootprint is the world's first Urban Intelligence Platform. We provide critical intelligence to the institutions that are rebuilding the world's infrastructure. Where does the energy sector invest in electrification, decarbonization, and asset hardening in the face of climate threats? Where do cities and businesses invest to catch up with e-commerce, last-mile delivery, and new mobility? Where do governments deploy relief and new infrastructure to combat record hunger, homelessness, and hazard vulnerability? UrbanFootprint is 'Google Maps for the Modern Enterprise.' We organize, normalize, and align thousands of urban, climate, and community metrics across 120 million US land parcels. The platform delivers targeted insights via dynamic data streams and elegant collaborative web mapping applications. We enable our customers to answer complex questions in minutes versus weeks, months, or years. Our customers include some of the largest energy utilities, major financial institutions, critical government agencies, top urban planning firms, and fast-growing mobility companies. We're growing rapidly in a market with a TAM of $22B, and our competition is old-world manual consulting or outdated software tools that come without the data, the models, or the insights. Our founders, Joe DiStefano and Peter Calthorpe, are urban planning pioneers who have spent decades providing critical urban intelligence to cities and enterprises across the globe. UrbanFootprint was named one of the World's Most Innovative companies in 2021, and is on the GovTech 100 list. Our platform was awarded the top spot in FastCo's Innovation by Design competition. The role As a socially-driven Senior Data Engineer, you will be building UrbanFootprint's foundational data infrastructure. These internal systems and tools enable Data Scientists to develop our proprietary models and algorithms that address wide ranging, real-world problems. By ensuring that data is accurate, accessible and up-to-date, this platform powers models that allow our customers to design environmentally conscious growth plans, target government relief dollars, and promote equitable community development. You are an enthusiastic enabler; you love knowing that your tools empower others to thrive. You know that company growth and success means enabling others to unblock themselves by building systems that meet internal user needs, crafting discoverable documentation, and building scalable infrastructure that is easy to use. As our Data Engineer on Data Platform, you're thoughtful and pragmatic; you work with others to build for the business needs of today and participate in design discussions for the future. You understand that data quality doesn't mean 'there's no missing data'; it means understanding missing data happens. You know that you need to work with downstream consumers on how and when to address it. You are autonomous, not independent; you incorporate feedback from engineers and end-users, clearly communicate progress, and deeply care about the reliability of the infrastructure and the accuracy of the data you produce. What you'll do Collaborate with data engineers and our Head Architect to implement our internal data infrastructure, focusing on data storage and versioning, data discoverability and data access. Responsible for the accuracy and integrity of our data and building automated systems and processes to ensure our systems stay current with the latest public data. Collaborate with Data Scientists to define, monitor and automate data quality metrics reporting. Support team members to build a world-class "full stack" data science culture and support data scientist and solution analyst independence. Your background most likely includes Work experience equivalent to an M.S. in Computer Science/Engineering or Data Engineering. 2 years of relevant industry experience including building and maintaining production ETLs. Proficiency in at least one SQL dialect, Python, and at least one scalable data analytics framework such as Dask, PySpark, or Apache Beam. Experience with Airflow. Experience with Kubernetes. Bonus qualifications Data engineering or machine learning-related certifications such as: GCP Professional Data Engineer or Machine Learning Engineer; Databricks Certified Associate Developer for Apache Spark, or similar. Hands on experience with the GCP platform including GKE, DataProc, Cloud Composer and/or Vertex AI. Experience working closely with data or machine learning scientists. Experience with geospatial or spatio-temporal data including both raster and vector data. Experience with open-source geospatial data such as Open Street Maps and US Census. You are socially driven to leverage data to facilitate a more equitable and resilient society. UrbanFootprint is committed to diversity in its workforce. We are committed to equal employment opportunity regardless of race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other legally protected class.

