Core ML Engineer: Deep Learning Architecture

MECKA ASSOC

New York, United States of America

6 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Compensation

$ 250K

Job location

New York, United States of America

Tech stack

API

Artificial Intelligence

Computer Vision

Software Debugging

Machine Learning

Software Deployment

PyTorch

Deep Learning

Low Latency

Optimization Algorithms

ONNX (Open Neural Network Exchange) Format

Production Code

Machine Learning Operations

TensorRT

Job description

We're hiring an ML and Optimization Specialist to lead model architecture improvements across all of Mecka's pipelines.

This role is heavily focused on foundational deep learning engineering rather than applied ML. We are looking for an engineer who natively writes, debugs, and modifies internal model architectures from the ground up, moving beyond utilizing off-the-shelf models or standard fine-tuning.

Many of our current ML systems rely heavily on frame-by-frame models, but all of our data is inherently temporal. Your immediate focus will be converting and optimizing these models for temporal inference - a critical unlock for pipeline performance.

Beyond that, you'll be the go-to person for model-level debugging, architecture design, and optimization across the organization. This is a high-leverage, deeply technical role for someone who thinks at the architecture level., * Tune and debug ML models at the model architecture level - modifying structural code, writing custom layers, and addressing the underlying math, rather than relying solely on high-level APIs or hyperparameter tuning

Profile and optimize model performance (latency, throughput, memory)
Evaluate and introduce new architectures, training strategies, and optimization techniques
Collaborate with CV, ML, and infrastructure teams to deploy improved models

Requirements

Deep expertise in ML model architecture design and optimization
Ability to tune and debug models at the architecture level - diagnosing issues in attention mechanisms, loss landscapes, gradient flow, etc.
Strong experience with temporal/sequential models (transformers, RNNs, temporal convolutions, state-space models)
Proficiency in PyTorch (or equivalent) at a research-engineering level
Experience optimizing models for production deployment

Strong Signals

Published papers or production experience with video understanding or temporal perception
Experience with model distillation, quantization, or efficient inference
Background in computer vision model architectures
Experience converting or adapting pre-trained models to new domains/modalities
Familiarity with ONNX, TensorRT, or similar inference optimization tools, * Obsessed with model internals - you think in terms of structural architecture and custom implementations, rather than just training runs and applied endpoints
Able to move between research papers and production code
A strong communicator who can explain architecture tradeoffs to cross-functional teams

Why This Role

Own the model architecture strategy across all of Mecka's pipelines
Solve a critical temporal modeling challenge with immediate impact
Work at the intersection of perception, robotics, and ML systems
High ownership in a fast-moving, well-funded robotics AI company

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all