Flexible Infrastructure Pricing

Shared or dedicated GPU resources for critical energy systems

90%+ GPU UTILIZATION

Continuous processing model maximizes efficiency and reduces costs by 60%

Shared Basic

Cost-Effective Access

$ 200 /month

Shared GPU with continuous processing and temporal isolation

Infrastructure

  • Shared GPU access via rotation
  • Temporal isolation for compliance
  • Continuous processing when active
  • Fair-share scheduling

Capabilities

  • OpenAI-compatible REST API
  • VPN-only secure access
  • Complete data isolation
  • Usage-based rotation

Support

  • Email support
  • Documentation access
  • Community forums
  • Monthly office hours
Get Started

No setup fee • Cancel anytime

Dedicated

Exclusive GPU

$ 2,000 /month

Your own dedicated GPU instance with exclusive 24/7 access

Infrastructure

  • Dedicated p5.2xlarge GPU
  • 120B MoE model
  • No sharing or waiting
  • Complete isolation

Features

  • Unlimited processing
  • 24/7 availability
  • 99.9% uptime SLA
  • Custom configurations

Support

  • 24/7 phone support
  • Dedicated success manager
  • Priority response
  • Custom integration help
Deploy Dedicated

Setup: $500 • Deploy in 15 minutes

High Availability

Mission Critical

$10,000+

Multi-AZ redundancy for critical grid operations

Infrastructure

  • Multi-AZ deployment
  • Redundant GPU instances
  • Auto-failover capability
  • Cross-region backup
  • 99.99% SLA target

Enterprise Features

  • Custom SLA terms
  • Priority support queue
  • On-site engineering team
  • Regulatory liaison services
  • Board-level reporting
Contact Sales

Volume discounts available

Cost-Effective GPU Sharing

How continuous processing delivers 60% margins

Component Shared Basic Shared Premium Dedicated
GPU Infrastructure Shared p5.2xlarge Shared p5.2xlarge (priority) Dedicated p5.2xlarge
Model Size 120B MoE 120B MoE 120B MoE
Actual GPU Cost ~$220/mo (1/10 share) ~$660/mo (3/10 share) $2,203/mo (full)
Network & VPN $20/mo $50/mo $150/mo
Storage & Backup $10/mo $25/mo $50/mo
Support & Ops $50/mo $200/mo $400/mo
Margin -$100/mo (60% margin) $265/mo (60% margin) -$803/mo (break-even)
Total $200/mo $1,200/mo $2,000/mo

Why shared infrastructure works: Through continuous processing and temporal isolation, we achieve 90%+ GPU utilization. Multiple customers share the same GPU through secure time-slicing, maintaining complete NERC CIP compliance while reducing costs by 60%.

Feature Comparison

Feature Shared Basic Shared Premium Dedicated High Availability
Infrastructure
Model Access 120B (shared) 120B (dedicated) 120B (redundant)
Deployment AWS Cloud AWS Cloud Multi-AZ
Availability Continuous when active 24/7 exclusive 24/7 redundant
SLA Best effort 99.9% 99.99%
Performance
Requests/Month Based on rotation Unlimited Unlimited
Rate Limit When active None None
Latency 50-200ms 50-200ms <50ms
Context Window 32K tokens 32K tokens 32K tokens
Security & Compliance
VPN Access
Encryption
Air-Gap Option
Audit Logs 30 days Unlimited Custom
Support
Support Level Email 24/7 Phone Dedicated team
Response Time 24 hours 1 hour 15 minutes
Success Manager
Custom Training 4 sessions/year Unlimited

Frequently Asked Questions

How does continuous processing work?

Instead of fixed time slots, our scheduler rotates GPU access based on actual usage. When you have work queued, you get continuous processing. This achieves 90%+ GPU utilization versus 10-20% with traditional slot-based systems.

What's the difference between 20B and 120B models?

The 20B model handles most tasks excellently - code analysis, documentation, Q&A. The 120B model excels at complex reasoning, extensive context understanding, and nuanced technical analysis.

Can I switch between plans?

Yes, you can upgrade or downgrade anytime. Cloud plans change immediately, on-premise requires hardware changes.

Do you store or train on our data?

Never. Your prompts and responses are processed in real-time and never stored. We don't use customer data for training or any other purpose.

How do you maintain NERC CIP compliance with sharing?

Temporal isolation ensures only one customer can access the GPU at any moment. Memory is completely flushed between sessions, and all access is logged for audit. This meets NERC CIP requirements for logical separation.

Can I try before committing?

Yes! We offer a 30-day proof of concept for qualified organizations. Contact sales to discuss your requirements.

What's the difference between shared and dedicated?

Shared infrastructure rotates GPU access between customers, achieving 60% cost savings. Dedicated gives you exclusive 24/7 access to your own GPU instance with no waiting or sharing.

How fast is deployment?

Cloud deployment takes 15 minutes. On-premise appliance ships in 2-3 weeks with remote setup assistance.

ROI Calculator

Compare GridTelligence to building your own LLM infrastructure

Build Your Own

  • P5.2xlarge GPU instance$2,203/mo
  • DevOps engineer (0.5 FTE)$8,000/mo
  • Security & compliance$3,000/mo
  • Monitoring & support$2,000/mo
  • Total Monthly Cost$15,203/mo
Save $158,436 per year

Ready to Secure Your Grid with AI?

Deploy in 15 minutes with our shared infrastructure

Get Started

No credit card required • Deploy in 15 minutes • Cancel anytime