Job Description

Summary

PeepalCo is the brand name for our Group entity and will house all our wealth-tech brands. The largest of our brands, CoinSwitch, will be housed under PeepalCo, so will our forays into Indian Equities and Mutual Funds.

What You Will Do:

  • Design, construct, and optimize scalable data pipelines using Python or Java, and leverage Spark (SparkSQL) within the Airflow scheduler/executor framework to meet business and product needs.
  • Develop and maintain robust data warehouses and lakes on AWS, ensuring architecture supports efficient data retrieval and storage.
  • Author complex SQL queries and design data models to optimize for performance and scalability, addressing product and business requirements.
  • Implement real-time data processing pipelines within a micro-services architecture, ensuring timely data availability and integrity.
  • Collaborate with cross-functional teams (Data Science, Product, Business) to support data infrastructure and analytics initiatives.
  • Advocate for and implement best practices in data engineering, including Agile, TDD (Test-Driven Development), and CI/CD (Continuous Integration/Continuous Deployment) to enhance team productivity and product quality.
  • Stay abreast of emerging data technologies and evaluate their application to continually improve the data ecosystem within the company.

What You Should Have:

  • 3+ years of data engineering experience.
  • Strong programming skills in Python or Java, with a proven track record of solving complex data engineering problems.
  • Working knowledge of relational databases, with expertise in SQL query authoring and data model design for optimal storage and retrieval.
  • Able to implement ETL processes and frameworks using DBT.
  • Experienced in building scalable data pipelines using Spark (SparkSQL) and managing workflows with Airflow or similar tools.
  • Proficient in developing real-time data pipelines within a micro-services architecture.
  • Hands-on experience with AWS data services (e.g., S3, GLUE, EMR) or equivalent technologies in the Apache ecosystem (e.g., Spark, Flink, Hive, Kafka).
  • Demonstrated expertise in using Redshift/Snowflake for data warehousing solutions, including data storage, processing, and analysis within Snowflakes environment.
  • Solid understanding of best practices in software development, including Agile methodologies, TDD, and CI/CD processes.
  • Familiarity with modern data storage formats (e.g., Hudi, Iceberg, Delta Lake) is highly desirable.

Life at PeepalCo:

We take great pride in what we do, and are committed to our mission. And we have a lot of fun while at it!

Heres how we do things at PeepalCo:

  • Customer-first: Thats the North Star. Everything we do is to make our users investment experience better and simplified.

  • Ownership: We dont sport lab coats, but we experimenta lot. And we take ownership. We even have a catchphrase for this: Think big, fail fast, and build better.

  • Data-driven: The source of truth. Simple as that.

  • Fun: PS5, anyone? Or do you prefer Foosball? Or perhaps Carrom? And yes, our HR team has a whole list of activities: Disco nights, offsites, gift boxes, and more!

Speaking of lists, the perks and benefits are so extensive, this space isnt enough. Here are a few:

  • Parenthood: Up to 8 months of Maternity leave and 1 month of Paternity leave

  • Gender Reassignment Surgery: Be the best version of you! Well support you and reimburse your medical bill.

Skills
  • AWS
  • Database Management
  • Development
  • Problem Solving
  • Python
  • Software Engineering
  • SQL
© 2024 cryptojobs.com. All right reserved.