MCI helps customers take on their CX and DX challenges differently, creating industry-leading solutions that deliver exceptional experiences and drive optimal performance. MCI was named by Inc. Magazine as Iowa's Fastest Growing Company in the State of Iowa. MCI employs 10,000+ talented individuals with 150+ diverse North American client partners.
AI Systems Administrator
Location
Canada
Posted
42 days ago
Salary
0
Seniority
Mid Level
No structured requirement data.
Job Description
AI Systems Administrator
The Sydney Call Centre
Role Description We are seeking a technically skilled, proactive, and detail-oriented AI Systems Administrator to support, maintain, and optimize the infrastructure powering our artificial intelligence and machine learning environments. This role ensures the reliability, scalability, and security of AI systems, models, and related data pipelines bridging the gap between technical operations and advanced AI innovation. Key Responsibilities: - System Management: - Oversee, configure, monitor AI and ML systems, servers, and cloud environments to ensure optimal performance and uptime. - Manage GPU/CPU clusters and ensure efficient resource allocation for training and inference workloads. - Infrastructure Optimization: - Implement and maintain scalable infrastructure to support large language models (LLMs), data processing pipelines, and model deployment. - Optimize system performance through tuning, automation, and proactive maintenance. - Security & Compliance: - Apply best practices for securing AI systems, ensuring data integrity, confidentiality and compliance with company and industry standards. - Manage user access, permissions, and security configurations across AI platforms. - Deployment & Integration: - Support the deployment and integration of AI models and APIs into production environments. - Collaborate with developers, data scientists, and prompt engineers to ensure seamless system functionality and workflow automation. - Monitoring & Troubleshooting: - Monitor system health, usage, and performance metrics; diagnose and resolve infrastructure or software issues. - Maintain logs, conduct root cause analysis, and implement corrective actions to prevent recurrence. - Automation & Scripting: - Develop scripts and tools to automate system tasks, data transfers, and performance checks. - Support CI/CD pipelines for AI model updates and system maintenance. - Documentation & Support: - Create and maintain detailed documentation of system configurations, procedures, and troubleshooting guides. - Provide technical support to AI teams, ensuring smooth operation of all AI systems and tools. - Research & Continuous Improvement: - Stay up to date with advancements in AI infrastructure, cloud technologies, and MLOps practices. - Recommend and implement improvements to enhance system reliability and scalability. Qualifications - Bachelor’s degree in Computer Science, Information Technology, Data Engineering, or a related field. - 2+ years of experience in systems administration, DevOps, or infrastructure management (AI/ML environment experience preferred). - Strong understanding of cloud platforms (AWS, Azure, GCP) and containerization technologies (Docker, Kubernetes). - Experience with Linux/Unix administration, Python/Bash scripting, and automation tools (Terraform, Ansible, Jenkins). - Familiarity with machine learning frameworks (TensorFlow, PyTorch) and AI model deployment pipelines. - Understanding of networking, security, and storage in distributed computing environments. - Experience with GPU-based computing and performance optimization for AI workloads. - Excellent problem-solving, troubleshooting, and documentation skills. - Strong collaboration and communication abilities to work with cross-functional AI and engineering teams. Requirements - Must be authorized to work in the country where the job is based. - Must be willing to submit up to a LEVEL II background and/or security investigation with a fingerprint. - Must be willing to submit to drug screening. (Does not apply in Canada) Benefits - Paid Time Off: Earn PTO and paid holidays to take the time you need. - Health Benefits: Full-time employees are eligible for supplemental health coverage through Blue Cross. - Life Insurance: Access life insurance options to safeguard your loved ones. - Supplemental Insurance: Accident and critical illness insurance. - Career Growth: With a focus on internal promotions, employees enjoy significant advancement opportunities. - Paid Training: Learn new skills while earning a paycheck. - Fun, Engaging Work Environment: Enjoy a team-oriented culture that fosters collaboration and engagement. - Casual Dress Code: Be comfortable while you work.
Related Guides
Related Categories
Related Job Pages
More System Administrator Jobs
• Provide responsive customer support via phone, email, and other channels. • Manage and maintain strong relationships with assigned customer accounts. • Accurately record all customer interactions, investigations, and resolutions. • Monitor system dashboards and security alerts, taking prompt action when needed. • Recreate and troubleshoot reported issues using internal guides and best practices. • Collaborate with internal teams to resolve complex technical problems. • Assist with application roll-outs and updates to ensure smooth implementation. • Identify and resolve system errors to maintain optimal performance. • Support configuration change projects and ensure accurate implementation. • Apply release scripts and perform testing to validate configuration changes. • Complete routine administrative tasks to support daily operations. • Manage communications through shared mailboxes efficiently. • Maintain up-to-date technical and procedural documentation. • Provide out-of-hours support when required to ensure service continuity.
• Take technical ownership of assigned clients, including escalated issues, ensuring resolution of infrastructure- and business-critical problems. • Perform advanced support and troubleshooting for escalated incidents, including network, server, cloud, business application, and workstation issues. • Evaluate client infrastructure, systems, software, and processes to identify deficiencies and recommend improvements or projects to meet client needs. • Lead major incident outages, conduct post-incident reviews, and drive corrective actions, including root cause analysis and documentation. • Identify systemic and pervasive issues, perform trend analysis, and implement permanent solutions to prevent future problems. • Design and validate multi-tenant solutions, including identity management, networking, security, disaster recovery, zero trust, hybrid AD/Entra, Intune baselines, conditional access, identity governance, complex networking/firewall policies, SASE/SWG, and SIEM integrations. • Benchmark, tune, and optimize systems for performance, capacity, and scalability. • Create and maintain documentation, network diagrams, change control approvals, and knowledge base content. • Develop and maintain automation and operational tools, including PowerShell modules, RMM policies, deployment pipelines, compliance checks, and monitoring improvements. • Deliver training, mentor junior engineers, and contribute to team readiness and process improvement initiatives.
Systems Administrator
HireHawkSave up to 80% on payroll with fully vetted global contractors—compliant and productive from day one.
• Implementing and administering Zoho CRM and Zoho Recruit to support business operations. • Building workflow automations and leveraging Zoho APIs for process efficiency. • Configuring custom modules and fields within Zoho to meet business requirements. • Integrating Zoho with other platforms using webhooks or iPaaS tools. • Creating and maintaining comprehensive technical and process documentation. • Supporting end-users through ticketing systems and delivering training for business systems.
Senior Systems Administrator – Power Platform, Azure
PingWindPingWind is CVE-certified and a service-disabled-veteran-owned small business (SDVOSB) helping federal government clients increase the security and performance
• Administer and maintain Azure and Power Platform environments, ensuring high availability, performance, and compliance. • Configure, monitor, and optimize resources including Logic Apps, Functions, and Power Automate flows. • Support the design and implementation of CI/CD pipelines (GitHub Actions, Jenkins, Azure DevOps) for secure and efficient deployments. • Apply and maintain automation frameworks (Power Platform, Azure Automation) to reduce manual intervention and improve operational consistency. • Implement and oversee security monitoring, incident response, and compliance reporting aligned with federal O&M standards. • Utilize observability tools to track performance, identify anomalies, and ensure service continuity. • Collaborate with developers, engineers, and cybersecurity personnel to support system integration and modernization initiatives. • Participate actively in Agile Scrum meetings and manage individual tickets through Rally or similar tools, ensuring clear and timely communication across development, engineering, and cybersecurity teams. • Assist with AI agent orchestration and integration of intelligent automation technologies (e.g., Copilot Studio) into system management workflows. • Participate in routine system audits, patching, and updates to maintain compliance with organizational and regulatory requirements. • Maintain system documentation and contribute to continuous improvement of processes and procedures.



