Senior Software Engineer
Cloudera empowers people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world's largest enterprises.
Cloudera is a leader in the fast-growing big data platforms market. This is a rare chance to make a name for yourself in the industry and in the Open Source world. The candidate will be responsible for Apache Hive and CDW projects.
As a Senior Software Engineer you will:
- Build robust and scalable data infrastructure software
- Design and create services and system architecture for your projects
- Improve code quality through writing unit tests, automation, and code reviews
- Write Java code and/or build several services in the Cloudera Data Platform
- Work with a team of engineers who reviewed each other's code/designs and held each other to an extremely high bar for the quality of code/designs
- Understand the basics of Kubernetes
- Build out the production and test infrastructure
- Develop automation frameworks to reproduce issues and prevent regressions
- Work closely with other developers providing services to our system
- Help to analyze and to understand how customers use the product and improve it where necessary
We are excited if you have:
- Deep familiarity with Java programming language
- Hands-on experience with distributed systems
- Knowledge of database concepts, RDBMS internals
- Experience working in a distributed team
- 3+ years of experience in software development
- Deep knowledge of distributed systems, query optimization, and columnar storage formats (Parquet, ORC)
You might also have:
- Experience in open source development and knowledge of Git, JIRA and Jenkins etc
- Experience with containerised environments
- Experience with the Hadoop ecosystem is a great plus
- Experience with distributed file systems / databases is a plus
- Familiarity with the internals of RDBMS, SQL, JDBC is a plus
- Experience with cloud infrastructure is a great plus
Why this role matters:
This is your opportunity to build cloud-native solutions that are deployable anywhere, whether in massive clusters on any cloud provider or in private data centers. You'll work with cutting-edge technologies like Trino, Spark, Airflow, and advanced AI inferencing systems to shape the future of analytics. Your code will directly influence how data engineers, analysts, and developers worldwide find value in their data. We believe in the power of open source. You'll collaborate with project committers, contributing upstream to keep technologies like Apache Hive and Impala evolving. You'll harden these engines for rock-solid security, optimize them for peak performance, and make them effortlessly run across all environments. Join us and help build the trusted, cloud-native platform that powers insights for the most data-intensive companies on the planet.
What you can expect from us:
- Generous PTO Policy
- Support work life balance with Unplugged Days
- Flexible WFH Policy
- Mental & Physical Wellness programs
- Phone and Internet Reimbursement program
- Access to Continued Career Development
- Comprehensive Benefits and Competitive Packages
- Paid Volunteer Time
- Employee Resource Groups