• Skip to main content
  • Skip to footer

Expedera

  • Products
    • Products Overview
    • Evolution for Edge
    • Evolution for Mobile
    • Evolution for Automotive
    • Evolution for Data Center
    • Origin E1
    • Origin E2
    • Origin E6
    • Origin E8
    • TimbreAI T3
  • Applications
    • Automotive
    • Data Center
    • Industrial
    • Mobile
    • Virtual Reality
  • News
    • News
    • In the News
    • Blog
    • Events
    • White Papers
  • About Us
    • Company
    • Leadership
    • Careers
    • Contact Us
  • Search
  • English
    • 简体中文

Search

Origin Evolution for Edge

Edge-friendly LLM and CNN AI Inference processing

Edge devices are increasingly equipped with advanced AI processing capabilities that enhance functionality and improve the user experience. While many of these devices previously depended on cloud-based inference, manufacturers are now shifting towards on-device inference. This transition helps to lower latency, reduce overall power consumption, and minimize the need for cloud processing, thereby cutting costs.

DOWNLOAD PRODUCT BRIEF

Perfect-Fit Solutions

Origin Evolution™ for Edge offers out-of-the-box compatibility with today's most popular LLM and CNN networks. Attention-based processing optimization and advanced memory management ensure optimal AI performance across a variety of networks and representations. Featuring a hardware and software co-designed architecture, Origin Evolution for Edge scales to 32 TFLOPS in a single core to address the most advanced edge inference needs.

Bringing AI to the Edge

Edge device makers are adding more AI to their products, including advanced LLM and CNN capabilities, as they enable a new set of applications including speech and visual contextual awareness, natural language queries, predictive maintenance, and human interaction assistance. For the best user experience, the privacy and latency advantages of edge processing are clear, with edge devices moving inference processing to the device. However, as today's leading LLMs may be 20 to 50X larger than more traditional AI networks employed on past generation devices, there are significant memory and processor hurdles to that device makers must overcome before this can be realized.

Innovative Architecture

Origin Evolution uses Expedera’s unique packet-based architecture to achieve unprecedented NPU efficiency. Packets, which are contiguous fragments of neural networks, are an ideal way to overcome the hurdle of large memory movements and differing network layer sizes, which are exacerbated by LLMs. Packets are routed through discrete processing blocks, including Feed Forward, Attention, and Vector, which accommodate the varying operations, data types, and precisions required when running different LLM and CNN networks. Origin Evolution includes a high-speed external memory streaming interface that is compatible with the latest memory standards.

Customizable
Highly Memory Efficient
Sustainable Performance
Easy to Deploy
LLM, CNN, and other Network Support
Choose the Features You Need
Customization brings many advantages, including increased performance, lower latency, reduced power consumption, and eliminating dark silicon waste. Expedera works with edge device customers to understand their use case(s), PPA goals, and deployment needs during their design stage. Using this information, we configure Origin Evolution to create a customized solution that perfectly fits the application.
Reducing Memory Bandwidth
Origin Evolution's packet-architecture reduces memory requirements of popular LLMs like Llama 3.2 and Qwen1 by as much as 79%, saving system power and offering a much better utilized processor.
Efficient Resource Utilization
Origin Evolution for Edge scales to 32 TFLOPS in a single core, eliminating the memory sharing, security, and area penalty issues faced by lower-performing, tiled AI accelerator engines. Origin Evolution NPUs achieve sustained utilization averaging 80%, compared to the 20-40% industry norm, avoiding dark silicon waste.
Full Software Stack
Origin Evolution employs an easy-to-use software stack that allows the importing of trained networks from popular representations such as Hugging Face, Llama.cpp, PyTorch, TVM, ONNX, TensorFlow, and others, while providing various quantization options, automatic completion, compilation, estimator and profiling tools. It also supports multi-job APIs.

Origin Evolution offers out-of-the-box support for 100+ popular neural networks, including Llama2, Llama3, ChatGLM, DeepSeek, Mistral, Qwen, MiniCPM, Yolo, MobileNet, and many others.

Unique Packet Architecture

Ultra-Efficient Neural Network Processing

Accepting standard, custom, and black box networks in a variety of AI representations, Origin Evolution offers a wealth of user features such as mixed precision quantization. Expedera’s unique packet-based processing reduces much larger networks into smaller, contiguous fragments, overcoming the hurdle of large memory movements and offering much higher processor utilization. Packets are routed through discrete processing blocks, including Feed Forward, Attention, and Vector, which accommodate the varying operations, data types, and precisions required when running different types of networks. Internal memory handles intermediate needs, while the memory streaming interface interfaces with off-chip storage.

Features
Specifications
  • 32 TFLOPS performance
  • Support for standard, custom, and proprietary neural networks
  • Readily customized to specific use cases and deployment needs
  • Full software stack provided, including compiler, estimator, scheduler, and quantizer
  • Runs LLM, CNN, RNN, DNN, LSTM, and other network types
  • Delivered as Soft IP (RTL) or GDS
Compute Capacityup to 16K FP16 MACs
Multi-taskingRun Simultaneous Jobs
Example Networks SupportedLlama2, Llama3, ChatGLM, DeepSeek, Mistral, Qwen, MiniCPM, Yolo, MobileNet, and many others, including proprietary/black box networks
Example Performance80 tokens per second, Llama 3.1 1B (INT4 weights, INT16 Act), 1 TOPS engine, 2MB internal memory, 64GB external peak bandwidth. Specified in TSMC 7nm, 1 GHz system clock, no sparsity/compression/pruning applied (though supported)
Layer SupportStandard NN functions, including Transformers, Conv, Deconv, FC, Activations, Reshape, Concat, Elementwise, Pooling, Softmax, others. Support for custom operators.
Data typesFP16/FP32/INT4/INT8/INT10/INT12/INT16 Activations/Weights
QuantizationSoftware toolchain supports Expedera, customer-supplied, or third-party quantization. Mixed precision supported.
LatencyDeterministic performance guarantees, no back pressure
FrameworksHugging Face, Llama.cpp, PyTorch, TVM, ONNX. Tensor Flow and others supported

Download our White Papers

White Papers

Get in Touch With Us

Contact

STAY INFORMED

Subscribe to our News

This field is for validation purposes and should be left unchanged.

Footer logo
  • Products
    • Products Overview
    • Origin E1
    • Origin E2
    • Origin E6
    • Origin E8
    • TimbreAI T3
  • Applications
    • Mobile
    • Entertainment
    • Virtual Reality
    • Smart Home
    • Automotive
    • Industrial
  • Latest News
    • News
    • Blog
    • Events
    • White Papers
  • About Us
    • Careers
    • Contact Us
  • Privacy Policy
  • Web Accessibility

Follow us

dashicons-facebook dashicons-linkedin dashicons-youtube

This site is protected by the Google Privacy Policy and Terms of Service apply

Copyright © 2025 Expedera. All Rights Reserved.

This website uses cookies to improve your experience. View our privacy policy Accept
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
CookieDurationDescription
cookielawinfo-checbox-analytics11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional11 monthsThe cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-necessary11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance11 monthsThis cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy11 monthsThe cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytics
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Others
Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.
SAVE & ACCEPT