Job Description
Summary
PeepalCo is the brand name for our Group entity and will house all our wealth-tech brands. The largest of our brands, CoinSwitch, will be housed under PeepalCo, so will our forays into Indian Equities and Mutual Funds.
What You Will Do:
- Design, construct, and optimize scalable data pipelines using Python or Java, and leverage Spark (SparkSQL) within the Airflow scheduler/executor framework to meet business and product needs.
- Develop and maintain robust data warehouses and lakes on AWS, ensuring architecture supports efficient data retrieval and storage.
- Author complex SQL queries and design data models to optimize for performance and scalability, addressing product and business requirements.
- Implement real-time data processing pipelines within a micro-services architecture, ensuring timely data availability and integrity.
- Collaborate with cross-functional teams (Data Science, Product, Business) to support data infrastructure and analytics initiatives.
- Advocate for and implement best practices in data engineering, including Agile, TDD (Test-Driven Development), and CI/CD (Continuous Integration/Continuous Deployment) to enhance team productivity and product quality.
- Stay abreast of emerging data technologies and evaluate their application to continually improve the data ecosystem within the company.
What You Should Have:
- 3+ years of data engineering experience.
- Strong programming skills in Python or Java, with a proven track record of solving complex data engineering problems.
- Working knowledge of relational databases, with expertise in SQL query authoring and data model design for optimal storage and retrieval.
- Able to implement ETL processes and frameworks using DBT.
- Experienced in building scalable data pipelines using Spark (SparkSQL) and managing workflows with Airflow or similar tools.
- Proficient in developing real-time data pipelines within a micro-services architecture.
- Hands-on experience with AWS data services (e.g., S3, GLUE, EMR) or equivalent technologies in the Apache ecosystem (e.g., Spark, Flink, Hive, Kafka).
- Demonstrated expertise in using Redshift/Snowflake for data warehousing solutions, including data storage, processing, and analysis within Snowflakes environment.
- Solid understanding of best practices in software development, including Agile methodologies, TDD, and CI/CD processes.
- Familiarity with modern data storage formats (e.g., Hudi, Iceberg, Delta Lake) is highly desirable.
Life at PeepalCo:
We take great pride in what we do, and are committed to our mission. And we have a lot of fun while at it!
Heres how we do things at PeepalCo:
-
Customer-first: Thats the North Star. Everything we do is to make our users investment experience better and simplified.
-
Ownership: We dont sport lab coats, but we experimenta lot. And we take ownership. We even have a catchphrase for this: Think big, fail fast, and build better.
-
Data-driven: The source of truth. Simple as that.
-
Fun: PS5, anyone? Or do you prefer Foosball? Or perhaps Carrom? And yes, our HR team has a whole list of activities: Disco nights, offsites, gift boxes, and more!
Speaking of lists, the perks and benefits are so extensive, this space isnt enough. Here are a few:
-
Parenthood: Up to 8 months of Maternity leave and 1 month of Paternity leave
-
Gender Reassignment Surgery: Be the best version of you! Well support you and reimburse your medical bill.
Skills
- AWS
- Database Management
- Development
- Problem Solving
- Python
- Software Engineering
- SQL