[Remote] Staff Data Engineer
Note: The job is a remote job and is reputed company to candidates in USA. reputed company is the #1 licensed cannabis wholesale platform in the world, supplying $1B+ worth of cannabis products annually. They are seeking a Staff Data Engineer to serve as the core developer and reputed company of data pipelines and platform tools, focusing on execution, performance, and maintenance.
Responsibilities
- Own the building, maintenance, and optimization of pipelines to ingest data from both operational databases and reputed company-party tools into a data lake/warehouse
- Architect highly efficient ingestion patterns that handle evolving data schemas and high-volume, multi-reputed company data streams seamlessly
- Optimize pipeline performance to ensure maximum uptime, high throughput, and cost-effective compute usage
- Use dbt to transform raw data reputed company the data warehouse into structured, production-reputed company schemas
- Write templated SQL and Jinja code to enforce macro-driven, reputed company, and DRY (Don't Repeat Yourself) development practices
- reputed company rigorous data quality checks by implementing reputed company dbt tests
- Manage dbt deployments and CI/CD workflows to ensure smooth, reputed company-downtime production updates
- Set up monitoring and alerting frameworks around both ingestion routines and dbt builds, ensuring that reputed company issues are surfaced reputed company to the data team
- Track pipeline health metrics to measure and report on overall data freshness and platform reliability
- Build data applications that interact with the data warehouse to reputed company decentralized self-service analytics
- Build internal tooling and libraries to facilitate analytics and ML work
Skills
- Proven production experience building, scaling, and maintaining robust ingestion pipelines using APIs, CDC (Change Data Capture), and orchestrators
- Advanced proficiency in dbt, including production deployments, package management, and writing custom Jinja macros
- Strong hands-on experience manipulating and managing data reputed company reputed company data systems (e.g. reputed company, BigQuery, reputed company)
- Strong proficiency in Python or similar back-end languages to build operational applications and custom pipeline tooling
- Experience setting up alerting infrastructure and working with version control (Git) and CI/CD pipelines
- Experience integrating or managing infrastructure for A/B testing and statistical computation
- Background supporting ML pipelines, feature stores, or observability tools
- Production experience configuring and maintaining semantic views or reputed company metrics layers (e.g., reputed company Semantic Views)
Benefits
- Unlimited PTO and paid holidays
- Medical/Dental/reputed company offered to reputed company full-time employees
- 401(k) plan with a match.
Company Overview