Posted
Ref: RP388-325
Job description / Role
Responsibilities
• Reports To (Title): Line Manager Last Updated: 02-Jan-2024
• Solid background on software development with strong python coding skills and solve challenging problems.
• Developing Data pipelines with Cloud Services & On-premise Data Centers.
• Web crawling, data cleaning, data annotation, data ingestion and data processing.
• Reading and collating complex data sets.
• Creating and maintaining data pipelines.
• Continual focus on process improvement to drive efficiency and productivity within the team.
• Use of Python, SQL, ES, Shell etc. to build the infrastructure required for optimal extraction, transformation, and loading of data.
• Provide insights into key business performance metrics by building analytical tools that utilize the data pipeline.
• Support the wider business with their data needs on an ad hoc basis.
• Open to extensive international business travel as and when required, and for extended periods.
Requirements
Qualification
• 6+ years of programming experience, solid coding skills in Python, Shell, and Java.
• Bachelor's degree in computer engineering, Computer Science, or Electrical Engineering and Computer Sciences.
• Strong practical knowledge in data processing and migration tools, such as Apache NiFi,
• Kafka, and Spark.
• Design, build, and maintain data processing with CDP(Cloudera Data Platform) Private Cloud.
• Develop and Maintain Data Workflow with Apache Airflow.
• Experience with HDFS or Similar Object Storage
• Strong Understanding about Distribute Computing and Distributed Systems
• Experience with Web crawling, cleaning.
• Experience with solution architecture, data ingestion, query optimization, data segregation, ETL, ELT, AWS, EC2, S3, SQS, lambda, Elastic Search, Redshift, CI/CD frameworks and workflows.
• Working knowledge of data platform concepts - data lake, data warehouse, ETL, big data processing (designing and supporting variety/velocity/volume), real time processing architecture for data platforms, scheduling and monitoring of ETL/ELT jobs
• PostgreSQL and programming (preferably Java, Python), proficiency in understanding data, entity relationships, structured & unstructured data, SQL and NoSQL databases.
• Knowledge of best practice in optimizing columnar and distributed data processing system and infrastructure.
• Experienced in designing and implementing dimensional modelling.
• Knowledge of machine learning and data mining techniques in one or more areas of statistical modelling, text mining and information retrieval
About the Company
Excelsior is a bespoke HR and recruitment consultancy, specialising in the Security, Facilities Management, Education and Automotive sectors. Excelsior provides a high quality, reliable and affordable solution to companies in these sectors.
Exciting opportunities and market insights will be regularly posted on this page. If you are a talented individual looking for a change, then our highly experienced Consultants are ready to match you with an exciting new career opportunity.
Data Engineer
ManpowerGroup Middle East |
Dubai | 14 Feb | |
Cloud Data Warehouse Engineer
Huxley |
Riyadh | 25 Jan | |
IT and Data Lead
Michael Page |
Saudi Arabia | 16 Feb | |
Data Strategy and Analytics Specialist
Propel Consult |
Saudi Arabia | 13 Feb | |
AI Engineer (Synthetic Image)
Excelsior Group ME |
Dubai | 12 Feb |