See all roles

[Remote] Network Engineer - Network Resiliency and High Availability

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Dice is seeking a Senior Network Engineer specializing in Network Resiliency and High Availability to ensure their global network infrastructure remains fault-tolerant and capable of seamless disaster recovery. The role involves designing, validating, and optimizing redundant paths and high-availability clusters while ensuring zero packet loss for business-critical applications during unforeseen failures.

Responsibilities

  • Design, implement, and maintain high-availability network topologies using physical and logical redundancy patterns (e.g., Multi-Chassis EtherChannel/MCLAG, VPC, and VSS)
  • Architect redundant Wide Area Network (WAN) transport paths utilizing dual-homed ISP connections, SD-WAN dynamic path selection, and automated failover technologies
  • Conduct controlled Network Chaos Engineering exercises (e.g., simulating fiber cuts, device power failures, and split-brain scenarios) to validate failover timers and resilience assumptions
  • Optimize enterprise routing protocols (BGP, OSPF, EIGRP) for ultra-fast convergence, tuning features like Bidirectional Forwarding Detection (BFD), Fast Reroute (FRR), and Graceful Restart
  • Implement First Hop Redundancy Protocols (HSRP, VRRP, GLBP) to guarantee default gateway redundancy for end-user and server segments
  • Manage complex traffic engineering strategies (e.g., BGP local preference, AS-path prepending) to ensure predictable asymmetric/symmetric routing during failure states
  • Lead the network engineering track for Corporate Disaster Recovery planning, including active-active and active-passive data center strategies
  • Design, configure, and maintain automated DNS-based failover (GSLB) and Anycast routing strategies to reroute user traffic away from degraded data centers or cloud regions
  • Keep comprehensive, up-to-date documentation on failover runbooks and infrastructure dependency maps
  • Deploy advanced monitoring tools to track metrics like Mean Time to Detect (MTTD) and Mean Time to Repair (MTTR)
  • Set up telemetry-based alerting (SNMP, gRPC/Streaming Telemetry) to identify gray failures (e.g., high interface error rates causing intermittent drops) before they cause total outages

Skills

  • 5+ years in a dedicated network engineering or operations role, with a proven track record of designing 99.99% or 99.999% (Four-to-Five Nines) uptime environments
  • Bachelor's degree in Computer Science, Computer Engineering, or equivalent practical experience
  • Design, implement, and maintain high-availability network topologies using physical and logical redundancy patterns (e.g., Multi-Chassis EtherChannel/MCLAG, VPC, and VSS)
  • Architect redundant Wide Area Network (WAN) transport paths utilizing dual-homed ISP connections, SD-WAN dynamic path selection, and automated failover technologies
  • Conduct controlled Network Chaos Engineering exercises (e.g., simulating fiber cuts, device power failures, and split-brain scenarios) to validate failover timers and resilience assumptions
  • Optimize enterprise routing protocols (BGP, OSPF, EIGRP) for ultra-fast convergence, tuning features like Bidirectional Forwarding Detection (BFD), Fast Reroute (FRR), and Graceful Restart
  • Implement First Hop Redundancy Protocols (HSRP, VRRP, GLBP) to guarantee default gateway redundancy for end-user and server segments
  • Manage complex traffic engineering strategies (e.g., BGP local preference, AS-path prepending) to ensure predictable asymmetric/symmetric routing during failure states
  • Lead the network engineering track for Corporate Disaster Recovery planning, including active-active and active-passive data center strategies
  • Design, configure, and maintain automated DNS-based failover (GSLB) and Anycast routing strategies to reroute user traffic away from degraded data centers or cloud regions
  • Keep comprehensive, up-to-date documentation on failover runbooks and infrastructure dependency maps
  • Deploy advanced monitoring tools to track metrics like Mean Time to Detect (MTTD) and Mean Time to Repair (MTTR)
  • Set up telemetry-based alerting (SNMP, gRPC/Streaming Telemetry) to identify gray failures (e.g., high interface error rates causing intermittent drops) before they cause total outages
  • Cisco Certified Internetwork Expert (CCIE - Enterprise Infrastructure or Data Center) or strong CCNP with equivalent experience
  • Juniper Networks Certified Internetworking Specialist/Expert (JNCIS/JNCIE)
  • Certified Business Continuity Professional (CBCP) or equivalent familiarity with DR frameworks is a plus

Company Overview

  • Dice is the go-to career marketplace for tech professionals. It was founded in 2010, and is headquartered in Drachten, Friesland, NLD, with a workforce of 201-500 employees. Its website is https://www.or-quest.nl/.
  • Company H1B Sponsorship

  • Dice has a track record of offering H1B sponsorships, with 2 in 2022, 4 in 2021, 5 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might like

    [Remote] Network Engineer - Multi-Cloud Connectivity Architecture

    Work from home Full-time role

    [Remote] Physical AI Field Applications Engineer (UR / MiR, Bay Area, CA Remote)

    Work from home Full-time role

    [Remote] Adobe Analytics Consultant

    Work from home Full-time role

    [Remote] Medicare Cost Report Auditor III

    Work from home Full-time role

    [Remote] Senior Manager, SEO/GEO

    Work from home Full-time role

    [Remote] Product Manager, Provider Flex Solutions

    Work from home Full-time role

    [Remote] SEO Website Developer

    Work from home Full-time role

    [Remote] Senior Platform Engineer

    Work from home Full-time role

    [Remote] Enterprise Account Executive

    Work from home Full-time role

    [Remote] IT Account Manager

    Work from home Full-time role

    Google Cloud Architect

    Work from home Full-time role

    Delta Airlines Work at Home Jobs - Entry Level

    Work from home Full-time role

    Remote Registered Nurse with compact license

    Work from home Full-time role

    Part-Time, Administrative Assistant/Receptionist – Amazon Store

    Work from home Full-time role

    Experienced Pharmacy Technician - Data Entry Specialist for Remote Patient Support and Clinical Services

    Work from home Full-time role

    Experienced Cloud Customer Engineer for Google Work From Home Opportunities - Full Time Remote Position with Competitive Salary and Benefits

    Work from home Full-time role

    PPC Consultant – Ongoing, Part-Time

    Work from home Full-time role

    Experienced Remote Data Entry Professional – Part-Time Opportunity for Career Growth and Development with blithequark

    Work from home Full-time role

    Customer Service Representative - Remote - Full Time - Paid Training - Medical Industry - Day Shift - Competitive Salary

    Work from home Full-time role

    Hiring Now: Require Leadership & Management within Complex

    Work from home Full-time role