API Reference

Supported Data Sources

Supported Data Sources

Spartera connects to your existing data infrastructure without requiring data migration
or replication. Our platform supports a comprehensive range of modern data warehouses,
databases, and analytics platforms across major cloud providers.

Amazon Web Services (AWS)

Amazon Redshift

  • Type: Cloud data warehouse
  • Best for: Large-scale data analytics, business intelligence
  • Authentication: AWS Access Keys, IAM roles
  • Features: Auto-scaling, columnar storage optimization

Amazon Redshift Serverless

  • Type: Serverless cloud data warehouse
  • Best for: Variable workloads, simplified operations
  • Authentication: AWS Access Keys, IAM roles
  • Features: Automatic scaling, pay-per-use pricing, no infrastructure management

Amazon Aurora Provisioned (MySQL)

  • Type: MySQL-compatible relational database
  • Best for: High-performance transactional workloads, MySQL migrations
  • Authentication: Database credentials, IAM database authentication
  • Features: Up to 5x MySQL performance, automated backups, read replicas

Amazon Aurora Provisioned (PostgreSQL)

  • Type: PostgreSQL-compatible relational database
  • Best for: High-performance transactional workloads, PostgreSQL migrations
  • Authentication: Database credentials, IAM database authentication
  • Features: Up to 3x PostgreSQL performance, automated backups, read replicas

Amazon Aurora Serverless

  • Type: On-demand, auto-scaling relational database
  • Best for: Intermittent workloads, variable traffic applications
  • Authentication: Database credentials, IAM database authentication
  • Features: Automatic start/stop, capacity auto-scaling, pay-per-second billing

Amazon RDS MySQL

  • Type: Managed MySQL database
  • Best for: Traditional MySQL workloads, lift-and-shift migrations
  • Authentication: Database credentials, IAM database authentication
  • Features: Automated backups, multi-AZ deployments, read replicas

Google Cloud Platform (GCP)

Google BigQuery

  • Type: Serverless data warehouse
  • Best for: Real-time analytics, machine learning integration
  • Authentication: Service Account Keys, OAuth 2.0
  • Features: Automatic scaling, built-in ML capabilities, petabyte-scale analytics

Cloud SQL MySQL

  • Type: Fully managed MySQL database
  • Best for: Web applications, standard MySQL workloads
  • Authentication: Built-in database authentication, Cloud SQL Proxy
  • Features: Automated backups, high availability, automatic storage increases

Cloud SQL PostgreSQL

  • Type: Fully managed PostgreSQL database
  • Best for: Web applications, standard PostgreSQL workloads
  • Authentication: Built-in database authentication, Cloud SQL Proxy
  • Features: Automated backups, high availability, automatic storage increases

Multi-Cloud Platforms

Snowflake

  • Type: Cloud-native data platform
  • Best for: Multi-cloud data sharing, diverse workloads
  • Authentication: Username/password, key pair authentication
  • Features: Zero-copy cloning, time travel queries, cross-cloud data sharing

Teradata Vantage

  • Type: Enterprise analytics platform
  • Best for: Complex analytical workloads, mixed processing
  • Authentication: Database credentials, LDAP integration
  • Features: Advanced analytics, graph processing, multi-cloud deployment

Database Platforms

Microsoft SQL Server

  • Type: Relational database management system
  • Best for: Enterprise applications, transactional workloads
  • Authentication: SQL Server authentication, Windows authentication
  • Features: Integration Services, Reporting Services, in-memory processing

Supabase PostgreSQL

  • Type: Open-source PostgreSQL database
  • Best for: Modern applications, real-time features
  • Authentication: Connection strings, API keys
  • Features: Real-time subscriptions, built-in authentication, RESTful API

Supabase Analytics

  • Type: Analytics-focused PostgreSQL
  • Best for: Application analytics, user behavior tracking
  • Authentication: Project API keys, service roles
  • Features: Built-in analytics functions, real-time insights, event tracking

Connection Architecture

Security-First Design

All connections to your data sources are:

  • Read-only: Spartera never modifies your source data
  • Encrypted: All data transmission uses industry-standard encryption
  • Credential-secured: Your database credentials are encrypted and stored securely
  • Network-isolated: Connections respect your network security policies

Data Locality

  • Zero Data Movement: Your data never leaves your infrastructure
  • In-place Processing: Analytics run where your data lives
  • Results Only: Only computed insights and aggregated results are transmitted
  • Compliance-Ready: Meets GDPR, SOC 2, and other regulatory requirements

Connection Requirements

Network Access

  • Outbound internet connectivity from your data platform
  • Firewall rules allowing connections to Spartera's processing endpoints
  • Optional: VPN or private network connectivity for enhanced security

Permissions

  • Read access to relevant databases, schemas, and tables
  • Ability to execute queries and analytical functions
  • Optional: Access to metadata for enhanced discovery

Credentials

Depending on your platform:

  • AWS: Access Key ID and Secret Access Key (with appropriate IAM policies), or IAM database authentication
  • GCP: Service Account JSON key file, or Cloud SQL Proxy connection
  • Database: Username and password with read permissions
  • API-based: API keys or tokens with data access rights

Performance Optimization

Query Optimization

  • Leverage your platform's native query optimization
  • Use indexed columns for filtering and aggregation
  • Consider partitioning strategies for large datasets

Resource Management

  • Configure appropriate compute resources for your workload
  • Monitor query performance and resource utilization
  • Implement query result caching where appropriate

Best Practices

  • Use specific column selection rather than SELECT *
  • Implement appropriate WHERE clauses to limit data scope
  • Consider time-based partitioning for temporal data
  • Optimize JOIN operations for better performance

Our platform is designed to work with your existing data architecture,
providing powerful analytics capabilities without disrupting your current
infrastructure or requiring costly migrations.