MulticoreWare

Case Study

Enhancing AI Model Support for RISC-V

October 3, 2024

Client

The customer is a RISC-V based AI accelerator company.

Challenge

The customer’s accelerator hardware initially supported a minimal set of models through their NN software ecosystem. The project’s goal was to extend support to various other models.

Our team was tasked with creating end-to-end model inference pipelines, demos, and benchmarking for various CNN and NLP models using their architectures and custom APIs. This included rewriting torch models using functional torch APIs, converting operations to the customer’s custom NN library, and optimizing the functional models for more efficient hardware use.

Solution

Leveraging our expertise in end-to-end model inference pipelines across various customer hardware, our team of solution architects successfully added support for models such as Stable Diffusion, Llama2, RoBERTa, Swin, and various CNN, NLP, and transformer-based models on different architectures of the customer’s hardware. The correctness of the model inference pipeline was verified using PyTorch reference code and the PCC metric.

With MulticoreWare’s expertise and the rapid development of APIs and features, we swiftly adapted to new APIs and enhanced models by analyzing memory layouts and configurations despite minimal documentation. At the customer’s request, we conducted unit tests of the operations for select models across various input resolutions.

Despite challenges with limited documentation and a rapidly evolving repository, our team successfully met the customer’s requirements, demonstrating our ability to quickly learn and apply new technologies, overcoming obstacles to deliver quality results.

Technology Overview

Solution Highlights

  • Developed an end-to-end model inference pipeline for 35+ models for the customer’s hardware architecture using their NN APIs.
  • Benchmarked more than 15 models with public datasets for CNN and NLP models.
  • Conducted unit testing and reported unsupported operation variants and issues for over 15 models.

Business Impact

MulticoreWare enhanced the customer’s market competitiveness by offering a comprehensive AI ecosystem, attracting a broader customer base. The project also increased revenue opportunities through higher adoption of their AI hardware and APIs, leading to business growth for the customer.

Conclusion

MulticoreWare demonstrated proficiency in creating end-to-end model inference pipelines, rapid adaptation to evolving APIs, and effective benchmarking and unit testing. Discover how we can help you achieve innovative results. Contact our team at info@multicorewareinc.com

Share Via

Explore More

May 8 2026

Optimizing Android Application Performance for Remote GPU Rendering Platforms

Customer
The customer is a technology company specializing in GPU virtualization middleware that enables discrete processing units to be aggregated into shared resource pools and accessed remotely across conventional network infrastructure.

Read more
Apr 9 2026

Agentic AI for RAN Observability, Explainability and Orchestration

Customer A global telecommunications and network infrastructure company that provides advanced software, hardware, and services for building, managing, and optimizing large-scale telecom and enterprise networks. Its solutions leverage AI, automation, and end-to-end visibility to help operators enhance performance, ensure reliability, and efficiently manage complex, multi-domain network environments. Problem Statement Radio Access Networks (RAN) are the  … Read more

Read more
Apr 3 2026

Embedded Platform Optimization for Advanced Drone Systems: Lidar and Motor Control Integration

Client A leading drone and robotics company developing high-performance UAV platforms for autonomous operations, industrial inspection, and surveying in complex or restricted environments. Problem Statement Simultaneously executing high-throughput LiDAR processing and latency-critical motor control on resource-constrained embedded systems creates a fundamental bottleneck in real-time performance and scalable UAV autonomy. Challenge 1: High-Speed Sensor Integration Integrating  … Read more

Read more

GET IN TOUCH

    Please note: Personal emails like Gmail, Hotmail, etc. are not accepted
    (Max 2000 characters)