Enterprise-grade hybrid AI platform 企業級混合式 AI 平台

AI Data Platform AI Data Platform

Phison AI Data Platform is Phison Electronics' complete one-stop AI infrastructure and application private cloud platform for external deployment. It covers underlying hardware, heterogeneous compute management, AI middleware, AI application modules, and SaaS services. 群聯 AI Data Platform 是群聯電子面向外部部署的完整一站式 AI 基礎設施與應用私有雲平台,涵蓋底層硬體、異質運算管理、AI 中介層、AI 應用模組與 SaaS 服務。

Faster更快Deployment部署

Stand up AI environments and model services in days, not months.快速部署 AI 環境與模型服務

Higher更高GPU UtilizationGPU 效益

Maximize GPU efficiency and overall resource utilization.提升 GPU 使用率與資源效率

Lower更低AI Infrastructure CostAI 基礎設施成本

Cut AI build-out and day-to-day operating expenses.降低 AI 建置與運營成本

Scalable可擴展AI ArchitectureAI 架構

Built for multi-node scale and future AI growth.支援多節點與未來 AI 擴充需求

Enterprise AI Today 企業 AI 的現實

The most common obstacles enterprises face on their path to private AI. 企業在私有 AI 路上最常遇到的障礙。

Inference & GPU utilization 推論與 GPU 利用率

ML Engineer at Scale ML Engineer at Scale

"Our GPUs sit at 35% utilization between jobs, but we still can't serve more users.任務空檔 GPU 只有 35% 利用率,但還是無法服務更多使用者。"

KV Cache exhaustion causes OOM and hard concurrent request caps. Multiple inference instances can't share a KV Cache pool, leading to redundant compute waste. Storage I/O becomes a bottleneck during checkpoint loading, and it's hard to dynamically allocate GPU resources across models. KV Cache 用完就 OOM,concurrent request 上限卡死。多個推論實例無法共享 KV Cache pool,造成重複計算浪費。存儲 I/O 在 checkpoint loading 時成為瓶頸,也難以動態分配 GPU 資源給不同模型。

Infrastructure operations 基礎設施營運

IT / Data Center Director IT / 資料中心主管

"Managing separate GPU racks, storage arrays, and switches is eating my team alive.GPU 機櫃、儲存陣列、交換器各自管理,快把團隊耗盡了。"

Three independent systems — compute, storage, and network — each need their own ops team. AI workload power density breaks traditional data center planning, while 6–9 month procurement cycles can't keep up with business expansion. Right-sizing is equally painful: over-provision wastes budget, under-provision breaks SLA. 運算、存儲、網路三套獨立系統各自維護,人力成本高。AI workload 的 power density 讓傳統機房規劃失效,6–9 個月的採購周期也跟不上業務擴張。Right-size 同樣困難:over-provision 浪費預算,under-provision 則影響 SLA。

Private LLM strategy 私有 LLM 策略

CTO / Technology Director CTO / 技術長

"We want private LLMs, but the complexity and upfront cost are paralyzing us.我們想要私有 LLM,但複雜度與前期成本讓我們動彈不得。"

Public cloud AI API costs exceed budget at production scale, and sensitive data can't leave the country — so OpenAI and Azure aren't viable. With no in-house GPU infra engineers, outsourcing risks vendor lock-in, and it's unclear which open-source LLM fits the business. 公有雲 AI API 費用在規模化後超出預算,敏感資料不能離境,無法直接用 OpenAI/Azure。內部缺乏 GPU infra 工程師,外包又怕 vendor lock-in,也不確定哪個開源 LLM 適合自身業務。

Compliance & sovereignty 合規與資料主權

Financial Services / Healthcare 金融服務 / 醫療

"Our compliance team vetoed cloud AI. We're stuck running nothing.合規團隊否決了雲端 AI,我們只能原地踏步。"

Regulations require inference on-premise under GDPR and healthcare privacy laws. Every AI inference needs a complete audit log that public cloud can't provide, and training data with sensitive customer info can't go to external services. Passing ISO 27001 / SOC 2 audits makes external AI services especially hard to approve. 法規要求推論運算必須在本地完成(GDPR、醫療個資法)。每一次 AI 推論需要完整 audit log,公有雲無法提供;模型訓練資料含客戶敏感資訊,也無法上傳外部服務。要通過 ISO 27001 / SOC 2 審查,外部 AI 服務更難獲准。

Shared GPU clusters 共用 GPU 叢集

Research Institution / University 研究機構 / 大學

"Our researchers wait days in GPU queues. Science doesn't wait.研究員在 GPU 排隊要等好幾天,科學不等人。"

Shared GPU cluster queues stretch experiment cycles three to five times. TB-scale training datasets hit storage throughput limits, and teams on PyTorch, JAX, and TensorFlow compete for the same resources. Grant-driven procurement and infrastructure lifecycle rhythms are completely out of sync. 共用 GPU cluster 排隊等候,實驗周期拖長 3–5 倍。TB 級訓練資料集受存取速度限制,PyTorch、JAX、TensorFlow 多團隊也互相搶資源。計畫型採購與 infrastructure lifecycle 的節奏完全脫節。

Inference & GPU utilization 推論與 GPU 利用率

ML Engineer at Scale ML Engineer at Scale

"Our GPUs sit at 35% utilization between jobs, but we still can't serve more users.任務空檔 GPU 只有 35% 利用率,但還是無法服務更多使用者。"

KV Cache exhaustion causes OOM and hard concurrent request caps. Multiple inference instances can't share a KV Cache pool, leading to redundant compute waste. Storage I/O becomes a bottleneck during checkpoint loading, and it's hard to dynamically allocate GPU resources across models. KV Cache 用完就 OOM,concurrent request 上限卡死。多個推論實例無法共享 KV Cache pool,造成重複計算浪費。存儲 I/O 在 checkpoint loading 時成為瓶頸,也難以動態分配 GPU 資源給不同模型。

Infrastructure operations 基礎設施營運

IT / Data Center Director IT / 資料中心主管

"Managing separate GPU racks, storage arrays, and switches is eating my team alive.GPU 機櫃、儲存陣列、交換器各自管理,快把團隊耗盡了。"

Three independent systems — compute, storage, and network — each need their own ops team. AI workload power density breaks traditional data center planning, while 6–9 month procurement cycles can't keep up with business expansion. Right-sizing is equally painful: over-provision wastes budget, under-provision breaks SLA. 運算、存儲、網路三套獨立系統各自維護,人力成本高。AI workload 的 power density 讓傳統機房規劃失效,6–9 個月的採購周期也跟不上業務擴張。Right-size 同樣困難:over-provision 浪費預算,under-provision 則影響 SLA。

Private LLM strategy 私有 LLM 策略

CTO / Technology Director CTO / 技術長

"We want private LLMs, but the complexity and upfront cost are paralyzing us.我們想要私有 LLM,但複雜度與前期成本讓我們動彈不得。"

Public cloud AI API costs exceed budget at production scale, and sensitive data can't leave the country — so OpenAI and Azure aren't viable. With no in-house GPU infra engineers, outsourcing risks vendor lock-in, and it's unclear which open-source LLM fits the business. 公有雲 AI API 費用在規模化後超出預算,敏感資料不能離境,無法直接用 OpenAI/Azure。內部缺乏 GPU infra 工程師,外包又怕 vendor lock-in,也不確定哪個開源 LLM 適合自身業務。

Compliance & sovereignty 合規與資料主權

Financial Services / Healthcare 金融服務 / 醫療

"Our compliance team vetoed cloud AI. We're stuck running nothing.合規團隊否決了雲端 AI,我們只能原地踏步。"

Regulations require inference on-premise under GDPR and healthcare privacy laws. Every AI inference needs a complete audit log that public cloud can't provide, and training data with sensitive customer info can't go to external services. Passing ISO 27001 / SOC 2 audits makes external AI services especially hard to approve. 法規要求推論運算必須在本地完成(GDPR、醫療個資法)。每一次 AI 推論需要完整 audit log,公有雲無法提供;模型訓練資料含客戶敏感資訊,也無法上傳外部服務。要通過 ISO 27001 / SOC 2 審查,外部 AI 服務更難獲准。

Shared GPU clusters 共用 GPU 叢集

Research Institution / University 研究機構 / 大學

"Our researchers wait days in GPU queues. Science doesn't wait.研究員在 GPU 排隊要等好幾天,科學不等人。"

Shared GPU cluster queues stretch experiment cycles three to five times. TB-scale training datasets hit storage throughput limits, and teams on PyTorch, JAX, and TensorFlow compete for the same resources. Grant-driven procurement and infrastructure lifecycle rhythms are completely out of sync. 共用 GPU cluster 排隊等候,實驗周期拖長 3–5 倍。TB 級訓練資料集受存取速度限制,PyTorch、JAX、TensorFlow 多團隊也互相搶資源。計畫型採購與 infrastructure lifecycle 的節奏完全脫節。

THE PLATFORM 平台能力

What Solution Does Phison Provide? 群聯提供哪些解決方案?

One-Stop AI Solution 一站式 AI 解決方案

Hardware, platform, AI modules, and application services are integrated through a single platform, preventing enterprises from having to integrate multiple vendors and complex technology stacks on their own. 硬體、平台、AI 模組與應用服務透過單一平台整合,避免企業自行串接多家供應商與複雜技術堆疊。

Unified GPU Management GPU 算力統一管理

Enterprise AI environments often face fragmented GPU resources and difficulties integrating and managing heterogeneous devices. Phison AI Data Platform can centrally manage GPUs/XPUs across different brands and generations, improving overall resource utilization and scalability. 企業 AI 環境常面臨 GPU 資源分散、異質設備整合與管理困難。Phison AI Data Platform 可集中管理不同品牌與世代的 GPU/XPU,提升整體資源利用率與可擴充性。

Lower Deployment Costs 降低部署成本

Through aiDAPTIV Middleware, reusable components, and pre-integrated AI modules, the platform lowers the barrier to enterprise AI adoption and reduces infrastructure costs, minimizing large upfront PoC and integration investments. 透過 aiDAPTIV Middleware、可重用元件與預整合 AI 模組,平台降低企業 AI 導入門檻與基礎設施成本,減少大型前期 PoC 與整合投資。

ARCHITECTURE 平台架構

AI Data Platform Architecture AI Data Platform 架構

Enterprises can flexibly select the most suitable hardware devices, AI modules, and application services based on their AI application needs, existing IT environment, and budget scale. Whether they require GPU/CPU servers, cache/storage architecture, AI vision, speech, or inference modules, each component can be freely combined and expanded. This helps enterprises quickly build a scalable, manageable, and commercially viable on-premises AI infrastructure platform with sustainable growth potential and optimized cost efficiency. 企業可依據自身 AI 應用需求、既有 IT 環境與預算規模,彈性選擇最適合的硬體設備、AI 模組與應用服務。無論是 GPU / CPU Server、Cache / Storage 架構,或 AI 視覺、語音、推理等模組,皆可自由組合與擴充,協助企業以最合適的成本快速打造可持續成長的可擴充、可管理、可商業化的地端 AI 基礎平台。

Ecosystem and Application Services 生態系與應用服務

API / AI Gateway API / AI 閘道
L4

Reusable AI Modules Layer 可複用 AI 模組層

  • AI Vision Module (VLM)
  • AI Speech Module (ASR)
  • AI Inference Module (LLM)
  • Data Integration
  • MLOps / LLMOps
L3

Reusable Open Source Components Layer 可複用開源元件層

  • AI Framework
  • Messaging Queue
  • Database
  • Networking
  • Monitoring
  • aiDAPTIV Middleware

Flexible Deployment from Edge to Data Center 從邊緣到資料中心彈性部署

One layered platform deploys consistently from edge sites to central data centers. Scale the four layers below—compute fabric, unified control, hardened open components, and reusable AI modules—in step without redesigning the stack. 同一平台可從邊緣站點延伸至中央資料中心一致部署。依序擴充下方四層的運算骨幹、統一管控、開源元件與 AI 模組,無需重新設計整體架構。

L1 Infrastructure Layer 基礎設施層

GPU, CPU, cache, and storage unified by Hyper-Connections into the platform compute and data fabric. 透過 Hyper-Connections 整合 GPU、CPU、快取與儲存,構成運算與資料骨幹。

L2 Resource Management Layer 資源管理層

Hyper-converged control plane for heterogeneous GPU, XPU, storage, and VMs—pool mixed brands and generations in one place. 超融合控制面統管異質 GPU/XPU、儲存與虛擬機,混品牌與世代設備集中池化。

L3 Reusable Open Source Components Layer 可複用開源元件層

Curated, hardened open-source components—frameworks, messaging, databases, and tooling—for composable AI workflows. 精選並強化適用 AI 的開源元件(框架、佇列、資料庫與工具),作為可組合的工作流基礎。

L4 Reusable AI Modules Layer 可複用 AI 模組層

Production-ready modules for vision, speech, inference, and model lifecycle operations—from training and optimization through deployment. 可投產的視覺、語音、推理與模型生命週期模組,涵蓋訓練、優化至部署與維運。

Ecosystem and Application Services 生態系與應用服務

Phison AI Data Platform can support various AI software platforms and SaaS services. It also includes customized AI applications and industry solutions jointly developed by Phison and its customers. 群聯 AI Data Platform 支援多種 AI 軟體平台與 SaaS 服務,並包含群聯與客戶共同開發的客製化 AI 應用與行業解決方案。

aiDAPTIV Middleware aiDAPTIV 中介層

Tiered AI memory · Less VRAM · Faster inference 分層 AI 記憶體 · 省 VRAM · 推論更快

Orchestrates tiered GPU, DRAM, and SSD cache—cutting VRAM, speeding inference, and lowering training cost. 統籌 GPU、DRAM 與 SSD 分層快取,降低 VRAM、加速推論、節省訓練成本。

  • Disaggregated AI Infrastructure 解耦式 AI 基礎設施

    A disaggregated architecture improves AI system scalability and resource utilization. 解耦式架構提升 AI 系統的可擴充性與資源利用率。

  • One-Stop AI Deployment Platform 一站式 AI 部署平台

    From hardware, platform, aiDAPTIV Middleware, and AI modules to SaaS applications, the platform provides complete one-stop AI solutions that significantly shorten AI adoption cycles. 從硬體、平台、aiDAPTIV Middleware、AI 模組到 SaaS 應用,提供完整一站式 AI 解決方案,大幅縮短 AI 導入週期。

  • AI Software Ecosystem Integration AI 軟體生態系整合

    By integrating ISVs, SIs, open-source components, and AI ecosystem partners, the platform accelerates enterprise AI commercialization and implementation. 整合 ISV、SI、開源元件與 AI 生態系夥伴,加速企業 AI 商業化與落地實踐。

GPU
CPU
HBM / GDDR
DRAM
aiDAPTIVCache
KV Cache
SSD

Built for AI-Driven Organizations 為 AI 驅動的組織而生

From regulated enterprises to fast-moving startups — one platform, every scale. 從受監管的大型企業到快速成長的新創 — 一個平台,適合所有規模。

Enterprise Customers 大型企業客戶

Mission-critical scale 任務關鍵等級規模

Financial services 金融服務

High-tech manufacturing 高科技製造

Healthcare industry 醫療產業

Government agencies 政府機關

Telecommunications 電信產業

Large data centers 大型資料中心

Small & Medium 中小型組織

Built for speed 為速度而生

AI startups AI 新創

SIs / ISVs 系統整合商 / 軟體商

AI adoption by SMEs 中小企業 AI 導入

AI Agent / GenAI app development AI 代理 / GenAI 應用開發

Common Application Scenarios 常見應用場景

Enterprise Private AI 企業私有 AI

Generative AI Platforms 生成式 AI 平台

AI Intelligent Agents AI 智能代理

AI Search / RAG AI 搜尋 / RAG

AI Meeting Assistants AI 會議助理

AI Vision Recognition AI 視覺辨識

AI Inference Computing AI 推論運算

AI Edge Computing AI 邊緣運算

BENEFITS & OUTCOMES 效益與成果

Production-Proven Performance 量產實證的效能

Real performance gains from production-deployed systems. 來自正式部署系統的真實效能提升。

+ Higher ROI 更高投資報酬

aiDAPTIV reduces dependency on high-end GPUs and VRAM waste — lowering infrastructure costs while keeping throughput. Less GPU, same performance, stronger AI infrastructure ROI. aiDAPTIV 降低對高階 GPU 與 VRAM 浪費的依賴,在維持吞吐的前提下削減基礎設施成本。更少 GPU、相同效能、更高的 AI 投資報酬。

1 × Concurrent Users 並發使用者容量

Same GPU cluster, more than 2× concurrency. Fine-grained scheduling maximizes throughput without adding GPUs. 相同 GPU 叢集,並發容量提升超過 2 倍。細粒度排程在不增加 GPU 的前提下最大化吞吐量。

5 5 ×↑ ×↑ Faster TTFT 更短首字延遲

Hit-and-return replaces recompute — 5×+ faster. Shared KV Cache across the cluster drops Time-to-First-Token immediately. 命中即返取代重算,快 5 倍以上。叢集共享 KV Cache,立即降低首字延遲。

Faster Deployment 快速部署

Compress traditional AI platform adoption from months to days — pre-integrated modules shorten PoC to production. 將傳統 AI 平台導入從數月壓縮至數天,預整合模組縮短 PoC 至正式部署的距離。