Work with different technology teams across infrastructure, and other divisions to deliver system solutions for the business.
Involve in development and system integration relevant systems with CDP / CDSW.
Hands-on experiences on Big Data solution and development based on Python technologies stack to design and implement secure, scalable and high-performance data processing pipelines.
Responsible for documentation of design, build and implementation deliverables owned by the individual and present it when needed.
Build a strong relationship and manage expectations with users and stake holders.
Requirements
At least 3 years’ relevant experience.
Strong understanding of Cloudera framework includes CDP (Cloudera Data Platform) and CDSW (Cloudera Data Science Work Bench)
Experience in implementing Data security and access control using Ranger/Atlas
Must have working understanding of Hadoop, Hive and Spark/pySpark architect
Experience in python web-application frameworks (Django)
Experience in shell scripting, automation and troubleshooting technical issues.
Other good to have skill sets include global market products:
Experience working in the financial industry with relevant experience in core data science using Cloudera platform based on Python technologies stack.
Previous experience on Data science project and understand the global markets products and its underlying pricing components (Market data analytics and identifying risk factors affecting pricing of the products)
Experience in Cloudera manager to monitor Hadoop services.
Good knowledge and working experience in Hadoop administration (incl. Hive, Impala, Kafka, zookeeper etc.).
Hands-on Techno-Functional role to analysis and propose solutions for business issues, process changes and functional requirements.
Strong team player with excellent communication & inter-personal skills.
Strong problem solver who can question and understand proposed solutions and business drivers.