Skip to product information
1 of 4

Novatech AI Compute Server — Dual RTX PRO 6000 Blackwell + Threadripper PRO — AI Training, Large Model Inference & GPU Rendering Server

Novatech AI Compute Server — Dual RTX PRO 6000 Blackwell + Threadripper PRO — AI Training, Large Model Inference & GPU Rendering Server

Ships Within 7 Business Days

Enterprise AI Training & Inference Server

up to Dual RTX Pro 6000 Blackwell | 192GB VRAM | Threadripper Pro

Purpose-Built for Large-Scale AI Workloads & Enterprise Workflows
This server is engineered for teams who need:
• Private, deterministic AI training and inference (no cloud variability)
• Fine-tuning multi-billion parameter models on dedicated hardware
• High concurrency inference for internal copilots and client APIs
• Secure and compliant compute environments (regulated or air-gapped)

Designed for AI startups, research labs, and enterprise ML teams that require performance, control, and predictable cost.



Unleash the Future of Professional Computing

The Novatech RM6000 is engineered for creators, AI innovators, and technical professionals who demand the absolute best in compute performance. Built in a 6U SilverStone RM600 rackmount chassis with exceptional airflow, this powerhouse features single or dual NVIDIA RTX™ PRO 6000 Blackwell Workstation GPUs—each equipped with 96GB of ultra-fast GDDR7 VRAM, delivering a massive 192GB total VRAM for unparalleled performance in AI training, simulation, 3D rendering, and large-scale data processing.


Cutting-Edge Hardware

  • CPU: AMD Threadripper PRO 7995WX – 96 cores / 192 threads, designed for extreme multitasking and stability.

  • Motherboard: ASUS Pro WS WRX90E-SAGE SE – enterprise-grade reliability with PCIe 5.0 support.

  • RAM Options: 128GB DDR5 ECC Registered (2×64GB, 5600 MHz) – rock-solid stability under heavy workloads.

  • GPU: 1x or 2x NVIDIA RTX PRO 6000 Blackwell Workstation Edition (VCNRTXPRO6000BPB) – 192GB total VRAM, 96GB per GPU.

  • Storage Options: Samsung 990 Pro Gen 4 NVMe SSDs (1TB, 2TB, or 4TB) for lightning-fast load times and massive project files.

  • Cooling: Thermalright XE360-TR5 liquid cooling for the CPU, optimized for Threadripper thermals.

  • Power Supply: FSP Cannon Pro 2000W Platinum-rated PSU – rock-solid stability for high TDP components.

  • Connectivity: Enterprise networking support, with optional 100GbE NIC compatibility.


Built for AI, Rendering & Extreme Workflows

Whether you’re training advanced AI models, rendering ultra-complex 3D environments, or running high-throughput simulations, the dual Blackwell architecture offers unmatched compute density and efficiency. With PCIe Gen 5 bandwidth and ECC memory throughout, you get maximum speed and data integrity in mission-critical environments.


Highlights

  • Total VRAM: 192GB ECC GDDR7 – massive parallel processing for AI & GPU-heavy workloads.

  • ECC DDR5 Memory: Error-correcting memory for unmatched reliability.

  • Upgradeable Storage & Networking: Scalable for future needs.

  • Rackmount-Ready: Perfect for data centers, production houses, and enterprise deployments.


Why Choose Novatech

At Novatech, we specialize in high-performance, custom-configured systems built with precision, tested for reliability, and backed by world-class customer support. Every RM6000 is stress-tested before shipping to ensure it meets the demands of your workflow from day one.


Bring next-generation power to your AI lab, VFX pipeline, or research environment with the Novatech RM6000 – Single or Dual NVIDIA RTX PRO 6000 Blackwell.

processor
Threadripper PRO 7995WX
Ram
128/256/512/1TB
Graphic card
1x or 2x NVIDIA RTX™ PRO 6000
Storage
1/2/4/8TB
Operating System
Windows 11 Pro or Linux
Power Supply
FSP Cannon Pro 2000W ATX
SAVE $1,500.00
Regular price $28,499.99
Regular price $29,999.99 Sale price $28,499.99
Sale Sold out
5% OFF
1-Year Warranty
Free Expedited Shipping*
Free Returns

Hardware Highlights

Core Compute

• AMD Threadripper PRO 7995WX – 32 cores / 64 threads — extreme multi-tasking & parallel workloads.

GPU Architecture

• Dual NVIDIA RTX PRO 6000 Blackwell — 96 GB GDDR7 each (192 GB aggregate) — massive VRAM headroom for LLMs.

Memory & Storage

• ECC Registered DDR5 — data integrity & uptime for mission-critical workflows.

• Optional 1–4 TB PCIe Gen4 NVMe — fast local datasets and model storage.

Enterprise-Grade Infrastructure

• Liquid CPU cooling + 2000W Platinum PSU — stability under long jobs.

• Rackmount chassis — 19″ rack data center ready.

What This System Enables

Engineered Workloads:

• On-Prem LLM Fine-Tuning (70B+ params) — local, secure, full-precision training

• Production Inference at Scale — high throughput APIs with low latency

• Private Copilots — govern models & data entirely in-house

• Computer Vision & Multi-Modal Training — large GPU memory for models and datasets

• Regulation & Compliance Computing — secure, air-gapped workflows

Shipping & Assembly

  • Assembly: Fully assembled, tested, and ready to use out of the box.

  • Plug & Play: Just connect peripherals and power, no setup required.

  • Shipping: Usually ships within 5-7 business days 

  • Free shipping across the USA (contiguous 48 states)

  • Packaging: Foam-protected internals, secure outer box 
View full details

AI Performance & Capability

Enterprise-aligned estimates for model fit, inference throughput, diffusion performance, and cloud ROI framing.

Capability

Model Capability Overview

Model / Class Quantization Inference Fine-tuning
Llama 3 70B FP16 / 4-bit
Mixtral 8x22B FP16 / 4-bit
13B–34B class FP16 🚀
405B (sharded) 4-bit ⚠️

Legend

= Supported / recommended
🚀 = Excellent / best performance
⚠️ = Possible, but with constraints

Actual capability varies by framework, context length, batch size, and optimization stack.

Throughput

Token Throughput Estimates

Model Precision Estimated tok/sec
7B FP16 600–900 tok/sec
13B FP16 350–600 tok/sec
34B FP16 180–300 tok/sec
70B FP16 120–200 tok/sec
70B 4-bit 220–350 tok/sec

Estimates assume an optimized inference stack (e.g., vLLM / TensorRT-LLM). Batch inference can materially increase throughput.

Generative

Image & Diffusion Performance

Workflow / Model Estimated performance
Stable Diffusion XL ~1–2 sec/image (optimized)
SD Turbo Sub-second generations (pipeline dependent)
Video diffusion (optimized) Near real-time depending on resolution

Render time depends on resolution, steps, scheduler, and pipeline optimizations.

ROI

Cloud Cost Comparison

Scenario Estimate
H100-class cloud instance $90–$110/hr (varies by region/commit)
8 hrs/day utilization ~$20k–$26k/month
Directional break-even Often ~4–6 months (utilization dependent)

Cloud rates vary by region, commitment, and availability. Use as directional procurement framing.

Private AI Workstation, Without Cloud Tradeoffs

Deploy an on-prem AI workstation designed for LLM training hardware, inference, and secure internal workflows—without giving up performance or control.

Max configuration

Reference build for LLM training & inference

Designed for teams that want predictable throughput and private data workflows—scalable from a strong base config to a fully-loaded build.
ThreadRipper PRO 7995X 512GB DDR5 RAM 2x RTX PRO 6000 8TB NVMe storage
Configurations can start smaller and scale to match your dataset size, batch/concurrency targets, and storage needs.
Use cases

Enterprise Workloads

Built for teams that need private AI infrastructure—where governance, predictable throughput, and dedicated GPU capacity matter.

AI inference workstation for internal copilots

Run private copilots, RAG, and assistants behind your firewall.
  • Predictable throughput for daily usage
  • Integrates with SSO/VPN/VPC workflows
  • Private AI infrastructure for sensitive prompts

Fine-tuning, evaluation & benchmarking

Iterate on models with controlled datasets and repeatable tests.
  • Hardware headroom for iteration speed
  • Reproducible runs for governance
  • Keep weights and datasets local

Vision, video & multimodal pipelines

GPU-heavy workloads for QA, inspection, and media analytics.
  • High VRAM headroom (GPU-dependent)
  • Batchable inference workloads
  • Cost control vs. per-minute cloud billing

Concurrent users & multiple workflows

Serve multiple teams or tools with headroom for peak usage.
  • Partition workloads across GPUs (config-dependent)
  • Higher concurrency for chat + retrieval
  • Supports larger contexts and tools
Fast path: include your model, expected users/concurrency, and whether data must remain on-prem.
Why on-prem?

Control, Security & Predictable Performance

If your workloads are steady—or your data can't leave your environment—on-prem can be the simplest path to consistent performance and privacy.
  • Private infrastructure: keep prompts, documents, and weights inside your environment.
  • Predictable performance: dedicated GPU capacity for inference and fine-tuning.
  • Cost stability: avoid hourly cloud spikes when usage is steady or growing.
  • Compliance alignment: easier governance for regulated data flows.
Important

Who This Is NOT For

We'll tell you if you're better served by cloud or a smaller system. This is the right fit when you need dedicated GPU capacity and plan to use it.
  • Occasional AI usage only (a few hours per month).
  • Need instant 50+ GPU burst scaling for short spikes.
  • Not ready to manage deployment/model ops (we can help, but it's still required).
  • Looking for a "gaming PC" experience—this is workstation-grade hardware.
Get in touch

Ready to spec your build?

Tell us your model, expected concurrency, and whether data must stay on-prem. We'll put together a configuration that fits.
Email Sales
sales@novatechsolutions.ai

3D & Video Production Performance

Directional, production-aligned expectations for 3D rendering, simulation, and professional video workflows.

Works great with

CPU: Threadripper Pro 7995WX (96C) GPU: Dual RTX Pro 6000 (Blackwell) Memory: Up to 512GB ECC Storage: Up to 10TB NVMe

3D Rendering

Rendering & Simulation

Software Workload Expected performance
Blender (Cycles) GPU path tracing / final-frame rendering 🚀 Strong dual-GPU scaling (scene/VRAM dependent); often ~2× throughput vs single GPU
Cinema4D + Redshift GPU rendering + interactive lookdev 🚀 High RTX throughput; dual-GPU scales well for final renders
OctaneRender Photoreal path tracing 🚀 Excellent multi-GPU scaling; ideal for heavy RTX workloads
Unreal Engine 5 Nanite + Lumen viewport / virtual production 🚀 Real-time viewport with complex assets; GPU + RAM help large projects
Houdini / Sim workloads CPU simulation + caching 🚀 96 cores + high RAM enables large sims and heavy caching

Multi-GPU scaling depends on renderer support and whether the scene fits in GPU VRAM. CPU simulations benefit from core count and memory bandwidth.

Legend

🚀 = Excellent / best-in-class
= Supported / recommended
⚠️ = Possible with constraints

Video Production

Editing, Color, Delivery

Software Workflow Expected capability
DaVinci Resolve 8K editing + color grading (GPU accelerated) 🚀 Smooth timeline playback in many workflows; strong GPU effects performance
Adobe Premiere Pro Multi-stream 4K editing + GPU effects ✅ Excellent real-time editing; performance depends on codecs/effects stack
After Effects Large comps + RAM-heavy projects 🚀 High RAM supports massive compositions and heavy caching
Media Encoder / Delivery H.264 / HEVC export (GPU accelerated) ✅ Fast exports with GPU acceleration (workflow dependent)

Playback/export depends on codec, effects stack, GPU acceleration settings, and storage throughput. Use these as directional expectations.

Large Project Handling

Built for Heavy Scenes & Complex Timelines

Resource Production benefit
512GB ECC RAM Massive scenes and simulations
96-core Threadripper Pro High performance physics, CPU rendering, simulation
Dual RTX Pro 6000 GPUs High ray-tracing throughput
Large VRAM capacity Handles complex textures and geometry
Up to 10TB NVMe storage Fast loading of large asset libraries

Best results come from aligning GPU VRAM capacity to your largest scenes (textures/geometry) and keeping active projects on fast NVMe storage.

Render Speed Comparison

Quick Value Comparison (Directional)

System Relative render speed
Typical RTX 5080 workstation
Single RTX 5090 or Pro 6000 workstation ~1.4–1.6×
Dual RTX 5090 or Pro 6000 workstation ~2.4–2.8×

Note: Estimates are directional and based on expected workstation-class behavior for comparable hardware. Actual results vary by scene complexity, codecs, effects, renderer settings, and software versions.

NOVATECH Apex WS9995X: Enterprise Power. AI-Driven. Future-Ready.
Experience extreme performance with the AMD Ryzen Threadripper PRO 9995WX (96 cores, 192 threads) and NVIDIA RTX PRO 6000 with 96GB VRAM. Built for AI, HPC, data science, 3D rendering, simulation, and content creation, the Apex WS9995X is the ultimate workstation for professionals who demand it all.
icon
AI & Data Science Optimized
Harness 96 cores and NVIDIA RTX PRO 6000 acceleration to power through AI model training, deep learning, and big data analytics without compromise.
icon
Scalable Power
With 512GB DDR5 ECC RAM and 10TB of NVMe Gen 5 storage, this workstation is engineered for massive datasets, predictive modeling, and enterprise workloads.
icon
Fast & Efficient
The Threadripper PRO 9995WX delivers industry-leading multithreaded performance, ensuring smooth multitasking, real-time rendering, and responsive workflows at every stage.
icon
Cool & Reliable
Advanced ASUS ProArt 420mm liquid cooling and a 1200W 80+ Gold PSU keep the system quiet, efficient, and stable under sustained heavy loads.
icon
Why Choose the Novatech AI Workstation?
Built for AI professionals, researchers, and creators who need uncompromising speed, reliability, and future-ready performance.
main-image
Is it powerful enough for machine learning?
Yes — with up to 96 cores / 192 threads and the NVIDIA RTX PRO 6000 (96GB VRAM), the Apex WS9995X is purpose-built for AI training, deep learning, and neural networks at scale.
Can it handle big datasets and multitasking?
Absolutely. With up to 512GB of DDR5 ECC memory and 10TB of Gen 5 NVMe storage, it’s designed to process massive datasets while running multiple applications simultaneously.
Is it reliable for long workloads and 24/7 operation?
Yes — advanced liquid cooling, enterprise-grade ECC memory, and 1000W+ Gold-rated PSUs ensure stability and reliability during continuous workloads and mission-critical tasks.
Will it stay relevant as technology advances?
Definitely. The ASUS Z790 motherboard supports PCIe 5.0, Wi-Fi 6E, and expansion options, making it a future-proof platform for years to come.
Is this workstation only for AI?
Not at all. It’s also perfect for 3D rendering, video editing, CAD, simulations, and high-end content creation, making it versatile for any professional workflow.
main-image
Unleash Enterprise-Grade Productivity With Novatech Apex Workstations
Transform your company’s workflows with faster AI model training, real-time rendering, and reliable performance — designed for professionals who need maximum power without compromise.
Accelerate Innovation

Cut model training times and speed up simulations with the AMD Threadripper PRO 9995WX (96 cores, 192 threads) and NVIDIA RTX PRO 6000 with 96GB VRAM.

Reliable For Business Workloads

Run AI, HPC, data analytics, 3D rendering, and engineering simulations without bottlenecks — keeping teams productive and projects on schedule.

Scale With Confidence

With 512GB DDR5 ECC memory, 10TB of Gen 5 NVMe storage, and a future-ready WRX90E platform, your workstation grows with your company’s needs.

Customize