ML Application Engineer
About The Position
We are looking for an experienced AI/ML engineer to join our growing AI pipelines team.
Company Overview
NeuReality is developing an innovative HW/SW computing platform for DL inference acceleration which sets new, unprecedented, bars of high performance and cost. Our platforms are targeted for cloud and enterprise (in-perm) environments.
Role description:
- You will join the core AI/ML Pipeline development team of Neureality R&D which is responsible for the following main deliverables:
- Designing, analyzing, and optimizing workloads from various sources (open source, customer-provided, home-grown) on Neureality platforms. The focus is on workloads for NLP, speech, and computer vision.
- Benchmarking and competitive analysis of workloads on other inference acceleration platforms.
- Working directly with customers on new requirements and efficient deployment of their workloads on
NeuReality platform:
Identifying missing gaps and new requirements for SW/HW to improve workload performance and efficient deployment.
This is an exciting opportunity to work on cutting-edge and emerging technologies, across multi-disciplinary domains of deep-learning models and computer architectures.
This is not a position of data science!
Requirements
- BSc/MSc in Computer Science or Computer Engineering from an accredited university
- 3 years of hands-on experience in Python programming and DL frameworks (mainly PyTorch)
- Experience in hardware/embedded system is an advantage
- Experience in implementing algorithms on embedded systems
- Experience in ML engineering and specifically, developing AI pipelines (composed of trained DL models and pre/post-processing), data streaming, model zoo handling, and inference serving in production environments.