Search

Senior Staff Engineer, Memory/Network Systems

NuByt, Inc
locationSan Jose, CA, USA
PublishedPublished: 6/14/2022
Technology
Full Time

Job Description

Job Description

We are seeking a highly experienced Senior Staff Engineer to design and develop next-generation systems software spanning Linux, Kubernetes, memory technologies, and distributed infrastructure. This role requires deep expertise across the software stack—from Linux kernel and host operating systems to Kubernetes orchestration and cloud-native platform services.

The ideal candidate is a hands-on engineer who can work across boundaries traditionally divided between kernel, systems, and cloud platform teams.


Key Responsibilities

  • Architect and implement end-to-end systems software for advanced platform and memory technologies across Linux and Kubernetes environments.
  • Design and develop solutions spanning:
  • Linux kernel and host OS integration
  • User-space system services and node agents
  • Container runtime and kubelet integration
  • Kubernetes APIs, Controllers, CRDs, and orchestration workflows
  • Develop software for resource discovery, reservation, activation, resizing, release, rollback, and reconciliation within clustered environments.
  • Build and maintain Kubernetes Operators, Controllers, and automation services using Go, client-go, controller-runtime, and related frameworks.
  • Collaborate with Linux, firmware, and hardware engineering teams to translate low-level platform capabilities into Kubernetes-manageable workflows.
  • Work on Linux-side integration involving memory lifecycle management, memory hotplug, NUMA awareness, cgroups, device/resource management, and system telemetry.
  • Define interfaces between host software and cluster control-plane components, including failure handling, recovery, and state management.
  • Develop node-local software and control-plane services that coordinate host state, platform services, and Kubernetes resources.
  • Drive architecture around reconciliation, idempotency, concurrency control, and fault recovery across distributed systems.
  • Create observability solutions for controller behavior, node readiness, resource lifecycle management, and failure analysis.
  • Support system bring-up, debugging, validation, and performance tuning in lab and pre-production environments.


Required Qualifications

  • Strong software engineering experience in Linux systems programming.
  • Deep understanding of Kubernetes internals, controllers, operators, CRDs, and cluster orchestration.
  • Proficiency in Go (Golang).
  • Experience with client-go, controller-runtime, and Kubernetes Operator development.
  • Strong knowledge of Linux kernel concepts, memory management, NUMA, cgroups, and system-level resource management.
  • Experience developing distributed systems and control-plane software.
  • Familiarity with container runtimes, kubelet interactions, and cloud-native infrastructure.
  • Experience designing highly reliable, fault-tolerant, and observable systems.
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...