Architect Specialist - AI Infrastructure GPU

Oracle

Saudi Arabia

Posted
Ref: RP556-3346

Job description / Role

Employment: Full Time

- Design, deploy, and optimize AI infrastructure solutions in the Oracle Cloud, focusing on GPU-accelerated workloads while incorporating relevant partner & open-source solutions popular in one of the sectors, such as FSI, Healthcare, Governments or high-tech/start-ups..
- Collaborate with multi-functional technical teams to design scalable and efficient architectures for AI/ML
- Provide technical expertise and support for GPU-based solutions, including performance tuning and benchmarking.
- Accelerate customer opportunities with new and existing customers through technical presentations, Proof of Concept (PoC), solution assistance. This involves driving value and innovation while focusing on sector-specific partner and open-source solutions.
- Research and evaluate emerging technologies and standard processes in AI infrastructure and GPU computing to drive continuous improvement.
- Create eloquent and compelling written assets proficiently, encompassing both external publications and internal documentation, particularly focusing on our open-source and partner solutions, benchmarks, and standard processes for running AI workloads on Oracle Cloud.

Requirements

Required Skills/Experience

- 8+ years of Solution Engineering experience with a Bachelor's or higher degree in Computer Science, Engineering, or related field.
- Proven experience in designing, implementing, and optimizing AI infrastructure solutions in cloud environments, with a focus on GPU workloads in one of the sectors mentioned above.
- Advanced practical knowledge of Networking, Data Center topologies, routing and switching protocols, and Ethernet/InfiniBand
- System level understanding of server architecture, PCIe devices, NICs, Linux OS, and kernel drivers. Experience with DevOps/MLOps technologies like Docker/containers, Kubernetes, data center compute/network/storage deployments.
- Proficiency in cloud platforms such as Oracle Cloud, AWS, Azure, or Google Cloud Platform, including services relevant to AI and GPU computing.
- Strong understanding of GPU architecture, CUDA programming, and parallel computing principles. Familiarity with AI frameworks and libraries (e.g., TensorFlow, PyTorch, scikit-learn)
- Experience in deploying AI models at scale, such as transformer-based NLP models, graph-based deep learning models, and/or geometric deep learning models.
- Experience with deploying use cases such as computer vision, recommender systems, medical imaging, drug discovery, genomics, predictive analytics, financial modeling or HPC in GPU-accelerated cloud environments
- Excellent problem-solving skills and the ability to solve sophisticated technical issues in cloud-based environments and effective communication skills with the ability to collaborate across teams and convey technical concepts to non-technical key-stakeholders.
- Certifications in cloud computing (e.g., Oracle Cloud Infrastructure Architect Associate, AWS Certified Solutions Architect, Azure Solutions Architect) and GPU computing (e.g., VIDIA Certified Associate - AI in the Data Center) are a plus.
- Background with open-source development with a Proficiency in systems engineering, coding (C/C++, Python, CUDA)
- Hands-on experience with NVIDIA systems/SDKs (e.g., Triton Inference Server), NVIDIA Networking technologies (e.g., DPU, RoCE, InfiniBand), and/or AMD GPU solutions.
- Relevant publication history and/or conference attendances. Occasional travel required for local on-site visits to customers and industry events.

About the Company

Oracle offers an integrated array of applications, databases, servers, storage, and cloud technologies to empower modern business. For most companies, flexibility is critical. Oracle provides a wide choice of software, systems, and cloud deployment models - including public, on-premises, and hybrid clouds - to ensure that technology flexes to the unique needs of a business.

Oracle Cloud is a complete, integrated stack of platform, infrastructure, and application services. With advanced scalability and security, Oracle Cloud enables technical agility across the enterprise, connects people to information for clearer insights, and fosters efficiency through simplified workflows.

More than 420,000 customers across 145 countries have harnessed Oracle technology to accelerate their digital transformation.

Job Alerts by Email
  • Personalised updates on latest career opportunities
  • Insights on hiring and employment activity in your industry
  • Typically sent twice a month
Architect salaries in Saudi Arabia

Average monthly compensation
SAR 10,000

Breakdown available for industries, cities and years of experience