Data Center
Managing Increasingly Complex AI Workloads
Cloud-based AI inference in data centers is a critical part of the AI ecosystem, enabling real-time decision-making and driving innovation across various industries, including retail, e-commerce, healthcare, industry 4.0, gaming, and many others.
Powering Flexible, Efficient AI Inference
Data centers serve as the backbone for many widespread AI applications, such as chatbots, coding assistants, predictive maintenance, intelligent analysis tools, and content moderation. This diverse usage requires AI inference solutions to support a wide variety of networks while maximizing efficiency in both power consumption and performance. These solutions must have the flexibility to accommodate today's popular networks, as well as the capability to support newer and larger networks in the future.
High Performance, High Scalability for Today and Tomorrow's Needs
Origin Evolution™ for Data Center offers out-of-the-box compatibility with popular LLM and CNN networks. Attention-based processing optimization and advanced memory management ensure optimal AI performance across a variety of today’s standard and emerging neural networks. Featuring a hardware and software co-designed architecture, Origin Evolution for Data Center scales to 128 TFLOPS in a single core, with multi-core performance to PetaFLOPs.
"Reducing Memory Bandwidth
Origin Evolution's packet-architecture reduces memory requirements of popular LLMs like Llama 3.2 and Qwen1 by as much as 79%, saving system power and offering a much better utilized processor."
"Full Software Stack
Origin Evolution employs an easy-to-use software stack that allows the importing of trained networks from popular representations such as Hugging Face, Llama.cpp, PyTorch, TVM, ONNX, TensorFlow, and others, while providing various quantization options, automatic completion, compilation, estimator and profiling tools. It also supports multi-job APIs."
Origin Evolution offers out-of-the-box support for 100+ popular neural networks, including Llama2, Llama3, ChatGLM, DeepSeek, Mistral, Qwen, MiniCPM, Yolo, MobileNet, and many others.