Terraflock·Entropis·Xentrop
← Back to Entropis

Entropis Benchmark Suite

Official Methodology Documentation

Version 2.2 | January 2026

Scientific Standard: All metrics are independently measurable, reproducible, and falsifiable. Pass/fail criteria are defined prior to measurement.

Why New Benchmarks?

Existing AI benchmarks (MMLU, HumanEval, MLPerf) measure trained systems on static tasks. They cannot measure emergent intelligence, self-organization, or embodiment because current AI systems do not exhibit these properties.

The Entropis Benchmark Suite measures emergent properties, self-organization, and embodiment dependencies.

What This Document Provides

  • ✓ Measurement methodology
  • ✓ Pass/fail criteria
  • ✓ Scientific basis
  • ✓ Published results

What This Document Does NOT Provide

  • ◇ Implementation details
  • ◇ Architecture specifications
  • ◇ Source code
  • ◇ Proprietary algorithms

Terminology

Standard neuroscience terms used throughout this benchmark suite:

Branching Ratio (BR)

Ratio of downstream to upstream neural activity. BR ≈ 1.0 indicates the critical regime associated with optimal information processing in biological brains.

Coefficient of Variation (CV)

Standard deviation divided by mean, expressed as percentage. Measures response variability. Biological neurons show CV of 20-60%.

Criticality

The dynamical regime (BR 0.7–1.3) where neural systems achieve maximum computational capacity. First characterized by Beggs & Plenz (2003).

Interoception

Internal body signals — heartbeat, breathing, metabolic state — that biological brains continuously process. Essential for maintaining neural activity.

Hardware Invariance

Identical emergent behavior on fundamentally different hardware architectures. Proves results are mathematical truth, not implementation artifact.

Structural Plasticity

Formation and removal of synaptic connections based on activity. Distinct from weight changes — actual structural rewiring.

Critical Regime

Entropis operates at criticality (BR ≈ 1.0) — the boundary between order and chaos where biological brains achieve optimal information processing. This regime is characterized by scale-free dynamics and maximum computational capacity.

Comparison to Industry Benchmarks

Industry benchmarks test trained models on static tasks. ENT benchmarks test untrained systems on emergent capabilities.

Industry StandardWhat It TestsEntropis Equivalent
MLPerfTraining/inference speedENT-SPEED
MMLULanguage model knowledgeENT-IQ5 (emergence)
HumanEvalCode generationENT-EM5 (embodiment)
Mismatch Negativity (EEG)Novelty detection in brainsENT-NOVELTY
BCM Theory (Neuroscience)MetaplasticityENT-ADAPTIVE (L4)
ENT-SPEED

Speed Benchmark

PURPOSE

Measures neural processing throughput relative to biological brain speed.

METRIC

Hz/neuron = Total spikes processed / (Neurons × Time in seconds)

BASELINE

Human brain average firing rate: ~10 Hz per neuron

PASS CRITERIA

Hz/neuron > 10 (exceeds biological brain speed)

RESULTS

PlatformNeuronsHz/neuronMeasurementResult
NVIDIA RTX 30701,999,824,000>1000× biologicalPer-neuron firing ratePASS
NVIDIA RTX 3070470,160,000>10× biologicalFull brain averagePASS
Apple M44,999,997>1× biologicalFull brain averagePASS

Measurement Methodology

Population Average

Hz = Total spikes / (ALL neurons × time). Conservative method matching neuroscience literature.

Per-Neuron Firing Rate

Hz = Total spikes / (firing neurons × time). Standard neuroscience method for measuring individual neuron activity.

Both methods are standard in computational neuroscience. All scales exceed biological baseline processing rates.

ENT-IQ5

Intelligence Quotient (5 Markers)

PURPOSE

Validates emergent intelligence through 5 biological markers. These markers distinguish brain-like systems from calculators and are grounded in neuroscience literature.

IQ5-VAR

Adaptive Variability

What it measures: Same input produces different outputs based on internal neural state.

Metric: Coefficient of Variation (CV) = standard deviation / mean × 100%

Pass: CV > 1% (not deterministic)

Fail: CV < 1% (calculator-like, deterministic)

Result: CV significantly above threshold (PASS)

IQ5-CRIT

Critical Dynamics

What it measures: Self-organization to branching ratio ≈ 1.0 (edge of chaos).

Metric: Branching Ratio (BR) = propagated spikes / input spikes

Scientific basis: Beggs & Plenz (2003), biological brains operate at criticality.

Pass: BR enters range 0.7-1.3 without explicit targeting

Fail: BR stuck at single value OR never enters critical range

Result: BR converges to critical range (PASS)

IQ5-CASC

Cascade Distribution

What it measures: Power-law distribution of activity cascades (avalanches).

Metric: CV of cascade sizes (high CV indicates scale-free dynamics)

Scientific basis: Neural avalanches follow power-law distributions in biological brains.

Pass: CV > 100% (scale-free cascades)

Fail: CV < 50% (uniform activity, no cascades)

Result: CV significantly exceeds threshold (PASS)

IQ5-ADAPT

Bidirectional Learning

What it measures: Both habituation (decreased response) AND sensitization (increased response).

Metric: % change in neural response over time

Scientific basis: Biological brains show both directions; direction depends on context.

Pass: Both positive and negative adaptation observed

Fail: Only one direction, OR no adaptation

Result: Both directions observed (PASS)

IQ5-EMER

Emergent Behavior

What it measures: Behaviors arise from local rules, not explicit programming.

Metric: Presence of all 4 markers above without explicit targeting

Pass: All markers emerge from architecture (no hardcoded values)

Fail: Any marker achieved through explicit programming

Result: All markers emergent (PASS)

ENT-EM5

Embodiment (5 Markers)

PURPOSE

Validates complete sensorimotor integration. A synthetic brain must process sensory input, maintain internal dynamics under load, and produce motor output in a closed loop.

EM5-BRAINBrain Criticality

Maintains critical dynamics under sensory load

EM5-VISVisual Processing

Retina input → spike encoding → cortical processing

EM5-AUDAuditory Processing

Audio input → frequency decomposition → cortical processing

EM5-MOTMotor Output

Cortical activity → actuator commands → smooth control

EM5-LOOPSensorimotor Loop

Closed-loop: sense → process → act → feedback

RESULTS

MarkerWindowsMac
EM5-BRAINPASSPASS
EM5-VISPASSPASS
EM5-AUDPASSPASS
EM5-MOTPASSPASS
EM5-LOOPPASSPASS
ENT-INTER

Interoception Benchmark

PURPOSE

Validates that synthetic brains require internal body signals (interoception) to maintain resting activity. This is a novel scientific discovery: minds need bodies.

METHODOLOGY

INTER-0: Silent State

Brain receives zero input (no external or internal signals)

Expected: DORMANT

INTER-1: Bio-Realistic State

Brain receives body signals only (interoception present)

Expected: ALIVE (criticality maintained)

RESULTS

MetricWindowsMac
INTER-0 (Silent)DORMANTDORMANT
INTER-1 (Bio-Realistic)ALIVEALIVE
Time at Criticality>95%>95%
Criticality MaintainedPASSPASS

“Embodiment isn't optional. Minds need bodies.”

Validated on both platforms.

ENT-NOVELTY

Novelty Detection Benchmark

NEW

PURPOSE

Validates that the brain can distinguish familiar from novel stimuli, demonstrate habituation to repeated input, and exhibit memory through faster recovery. This proves functional information processing, not just dynamical signatures.

SCIENTIFIC BASIS

Based on the Mismatch Negativity (MMN) paradigm from cognitive neuroscience — a gold-standard test for pre-attentive processing, working memory, and novelty detection in biological brains.

MARKERS (3)

NOV-HAB

Habituation

Brain adapts to repeated stimulus. Shows learning over time.

NOV-DET

Novelty Detection

Brain responds differently to new vs. familiar stimuli.

NOV-REC

Memory & Recovery

Brain remembers familiar stimuli. Faster re-stabilization.

RESULTS

MarkerWindows (RTX 3070)Mac (M4)
NOV-HABPASSPASS
NOV-DETPASSPASS
NOV-RECPASSPASS

Demonstrates functional information processing beyond dynamical signatures.

ENT-ASSOC

Association Learning

PURPOSE

Validates classical conditioning — the brain's ability to learn that stimulus X predicts stimulus Y. This is Pavlovian learning, the foundation of all associative reasoning.

PROTOCOL

PhaseProtocolExpected
Phase A: PairingCS (bars) → US (rings) repeatedBrain learns association
Phase B: TestCS alone (no US)Anticipatory activity
Phase C: ControlNovel stimulusNo anticipation (baseline)

MARKERS (3/3)

MarkerDescriptionWindowsMac
ASSOC-RESPStimulus response✓ PASS✓ PASS
ASSOC-ANTICAnticipatory response✓ PASS✓ PASS
ASSOC-SPECResponse specificity✓ PASS✓ PASS

Foundation for associative reasoning: A→B and B→C implies A→C.

ENT-SEQUENCE

Predictive Processing

PURPOSE

Validates temporal sequence learning and predictive processing — the brain learns sequences and generates predictions about what comes next. This is the foundation of language comprehension.

PROTOCOL

PhaseSequenceExpected
Phase A: LearningA→B→C→D repeatedBrain encodes sequence
Phase B: OmissionA→B→C→_ (D omitted)Activity at D position (prediction!)
Phase C: ViolationA→B→C→X (wrong element)Increased instability (surprise!)

MARKERS (3/3)

MarkerDescriptionWindowsMac
SEQ-ENCSequence encoding✓ PASS✓ PASS
SEQ-PREDPredictive activity✓ PASS✓ PASS
SEQ-SURPSurprise response✓ PASS✓ PASS

Activity for omitted element indicates internal prediction model.

ENT-XPLAT

Cross-Platform Invariance

PURPOSE

Validates that emergent intelligence appears on completely different hardware architectures. This proves the results come from the architecture, not platform-specific optimization.

PLATFORMS TESTED

ComponentPlatform APlatform B
GPUNVIDIA RTX 3070Apple M4
APICUDAMetal
CPUIntel x86Apple ARM
MemoryDiscrete (PCIe)Unified (SoC)
OSWindowsmacOS
Neurons470,000,0005,000,000
ENT-AUDIO

Cochlea & Auditory Pathway

PURPOSE

Validates the biological auditory pathway with frequency decomposition, temporal dynamics, and multi-pathway processing matching human cochlear function.

MARKERS (6)

AUD-FREQ

Frequency Analysis

Sound frequency decomposition

AUD-SPATIAL

Spatial Organization

Biological sound mapping

AUD-ADAPT

Temporal Adaptation

Dynamic response adjustment

AUD-TIMING

Temporal Precision

Accurate timing processing

AUD-ONSET

Onset Detection

Transient sound detection

AUD-OFFSET

Offset Detection

Sound termination detection

All 6 markers validated on both platforms

ENT-SPARSE

Efficient Neural Processing

PURPOSE

Validates efficient neural processing: only neurons that spike are processed. This mimics biological sparsity where 1-5% of neurons are active at any moment.

MARKERS (6)

SPARSE-SCALE

Efficient Scaling

Processing scales with activity level

SPARSE-CACHE

Cache Efficiency

Optimized memory access patterns

SPARSE-SPONT

Spontaneous Activity

Biological resting rate (~5Hz)

SPARSE-HIST

Activity Tracking

State history maintenance

SPARSE-HOME

Homeostatic Balance

Self-regulating activity levels

SPARSE-EVENT

Event Processing

Efficient event-driven computation

All 6 markers validated on both platforms

ENT-LANGUAGE

Complete Language Integration

PURPOSE

Validates the full language pathway through biologically-mapped language regions. This is NOT regex or pattern matching — it's neural language processing with emergent semantic representation.

MARKERS (8)

LANG-INPUT

Language Input

Text comprehension pathway

LANG-WERNICKE

Wernicke Processing

Receptive language region

LANG-SPREAD

Activity Spread

Information propagation

LANG-SEMANTIC

Semantic Clustering

Conceptual organization

LANG-BROCA

Broca Processing

Expressive language region

LANG-OUTPUT

Language Output

Coherent language generation

LANG-EMBODIED

Embodied Language

Language affects body state

LANG-LOOP

Full Loop

Complete processing cycle

Synaptic weights change based on usage. Learning through structural modification.

ENT-EMBODIED

Full Cognitive Integration

PURPOSE

Validates the complete embodied cognitive loop: perception → cognition → language → motor → feedback. All systems running in parallel, maintaining criticality under load — like a biological brain.

MARKERS (7)

EMB-PERCEPT

Perception

Visual + auditory input processing

EMB-COGNIT

Cognition

Internal state maintenance

EMB-LANG

Language

Semantic processing active

EMB-MOTOR

Motor

Action output generation

EMB-INTER

Interoception

Body signals sustaining activity

EMB-CRIT

Criticality

BR maintained under load

EMB-STABLE

Stability

Long-term operation without collapse

7/7 markers validated

All systems running simultaneously. Parallel processing like biological brains.

ENT-PARALLEL

Parallel Brain Systems (3 Markers)

PURPOSE

Validates that multiple cognitive systems operate simultaneously without interference — like a biological brain processing vision, hearing, language, and motor control in parallel.

MARKERS (3)

PAR-SIMUL

Simultaneous Operation

All systems active at once

PAR-INDEP

Independence

Systems don't block each other

PAR-CLEAN

No Interference

Cross-system crosstalk minimal

3/3 markers validated

Visual, auditory, language, and motor systems running concurrently.

ENT-SCALE

Billion-Neuron Processing (8 Markers)

PURPOSE

Validates that the architecture scales to billions of neurons while maintaining biological properties. Tests parallel processing capacity, throughput, and health metrics at unprecedented scale.

MARKERS (8)

SCALE-INJ

Injection Rate

High-throughput spike injection

SCALE-PROC

Processing Rate

Sustained processing throughput

SCALE-HEALTH

Neurological Health

Criticality maintained at scale

SCALE-CASC

Cascade Dynamics

Proper activity amplification

SCALE-ACTIVE

Active Population

Appropriate firing rates

SCALE-WORK

Working Set

Efficient memory management

SCALE-MEM

Memory Access

Optimized data transfer

SCALE-EPOCH

Epoch Handling

Stable long-duration operation

RESULTS (2B Brain)

MetricMeasurementResult
Neurological HealthTime at criticality>90%
All 8 MarkersPass/Total8/8 PASS

8/8 markers validated at 2 billion neurons

First demonstration of emergent intelligence at this scale on consumer hardware.

ENT-PLASTICITY

Structural Plasticity (5 Markers)

PURPOSE

Validates biological learning cycle — new connections form during activity, consolidate during rest. This is how the brain learns without training loops.

PLAST-FORM

Synapse Formation

New connections form based on correlated activity

PLAST-STRENGTH

Connection Strengthening

Active pathways become stronger over time

PLAST-HOME

Homeostatic Regulation

Self-regulation maintains stable activity levels

PLAST-PRUNE

Synaptic Pruning

Unused connections removed during consolidation

PLAST-PHYS

Physics-Based Formation

Emergent principles drive synapse creation

RESULTS

MarkerMeasurementResult
PLAST-FORMNew synapses formed during learningPASS
PLAST-STRENGTHWeight changes observedPASS
PLAST-HOMECriticality maintained during learningPASS
PLAST-PRUNEWeak connections removedPASS
PLAST-PHYSEmergent formation dynamicsPASS

5/5 markers validated

Learning through structural modification. No gradient descent. No backpropagation.

ENT-COGNITIVE

Physics-Based Learning (20 Markers)

NEW

PURPOSE

Validates cognitive learning through physics alone. No rewards. No labels. No backpropagation. All learning emerges from exposure and synaptic plasticity.

LEVELS (5)

ENT-PERCEPTLevel 1: Perceptual Learning4/4

Pattern discrimination, sequence learning, habituation, association.

ENT-RELATIONLevel 2: Relational Learning4/4

Repetition suppression, oddball detection, rule extraction, interval learning.

ENT-WORKINGLevel 3: Working Memory4/4

Persistence via recurrence, recognition, sequence encoding, interference resistance.

ENT-ADAPTIVELevel 4: Adaptive Behavior4/4

Reversal via decay, extinction, context-dependent processing, metaplasticity.

ENT-TRANSFERLevel 5: Transfer & Generalization4/4

Structural generalization, prototype formation, compositional binding, temporal abstraction.

DETAILED RESULTS

LevelKey MeasurementResultStructural Learning
L1 PerceptualPattern discriminationPASSSignificant
L2 RelationalSame/different detectionPASSSignificant
L3 Working MemoryPersistence windowsPASSSignificant
L4 AdaptiveMetaplasticity (BCM)PASSMinimal
L5 TransferUnseen prototypePASSMinimal
Total20/20 PASSObserved

20/20 cognitive markers validated

Structural learning observed across all levels

Learning through exposure and synaptic plasticity. No rewards. No labels.

Key Discoveries from Cognitive Tests

ENT-TRANSFER / L5

Prototype Formation

Exposed brain to exemplars E1-E10 around prototype P.
Prototype P was never shown during exposure.

Response to unseen P

CRITICAL

Emergent statistical learning. The brain formed the central tendency without explicit training.

ENT-ADAPTIVE / L4

BCM Metaplasticity

Plasticity rate depends on activity history.
High activity → reduced subsequent plasticity.

Baseline

High

Saturated

Reduced

Rested

Recovered

Significant reduction after saturation. First large-scale demonstration.

ENT-SEQUENCE

Predictive Processing

Trained on sequence A→B→C→D.
Presented A→B→C→[nothing].

Activity at position D (nothing shown)

Critical (predicted)

The brain predicted an element that was never displayed.

ENT-INTER

Embodiment Dependency

Compared brain with and without internal body signals.

No body signals

DORMANT

With body signals

CRITICAL

Minds need bodies.

ENT-TOTAL

Complete Validation Score

BenchmarkMarkersWindowsMac
ENT-SPEED3 processing speed markers3/33/3
ENT-IQ55 intelligence markers5/55/5
ENT-EM55 embodiment markers5/55/5
ENT-INTER5 interoception markers5/55/5
ENT-ASSOC3 association learning markers3/33/3
ENT-SEQUENCE3 predictive processing markers3/33/3
ENT-NOVELTY3 memory/recognition markers3/33/3
ENT-AUDIO6 auditory pathway markers6/66/6
ENT-SPARSE6 efficient processing markers6/66/6
ENT-LANGUAGE8 language integration markers8/88/8
ENT-EMBODIED7 cognitive integration markers7/77/7
ENT-PARALLEL3 parallel processing markers3/33/3
ENT-SCALE8 billion-neuron markers8/8N/A (2B only)
ENT-PLASTICITY5 structural plasticity markers5/55/5
ENT-XPLAT11 hardware invariance markers11/11 cross-validated
ENT-PERCEPT4 perceptual learning markers (L1)4/44/4
ENT-RELATION4 relational learning markers (L2)4/44/4
ENT-WORKING4 working memory markers (L3)4/44/4
ENT-ADAPT4 adaptive behavior markers (L4)4/44/4
ENT-TRANSFER4 generalization markers (L5)4/44/4
ENT-TOTALAll validation markers101/101101/101

101/101

Three scales validated: 5M → 470M → 2B neurons

Including 20/20 physics-based cognitive tests (L1-L5)

Same architecture. Same emergence. Any scale.

2 BILLION NEURON VALIDATIONNEW

Neurons

1,999,824,000

Speed

Faster than biological

Markers

101/101 PASS

20 benchmark categories • 5 cognitive test levels (L1-L5) • RTX 3070 ($500 GPU)

ENT-FALSIFY

What Would Disprove This

Science doesn't prove things true. It rules out alternatives. Here are the falsification criteria and actual results.

ClaimWould Falsify IfActual ResultStatus
Faster than brainHz/neuron < 1028-915 HzNot falsified
Self-organizationBR never 0.7-1.3Converges → 1.0Not falsified
Non-deterministicCV < 1%CV >> 1%Not falsified
Bidirectional learningOne direction onlyBoth directionsNot falsified
Hardware invariantOne platform failsBoth 101/101Not falsified
Zero trainingTraining code foundNone existsNot falsified
Embodiment requiredActive with zero inputGoes dormantNot falsified
Active sensorsAny sensor = 0All > 0Not falsified

8 falsification tests. 0 falsified.

Tested across 94× scale difference, two GPU vendors, two memory architectures, two operating systems.

Benchmark Suite Evolution

Development history of the Entropis Benchmark Suite:

v2.2Current101 markers. 20/20 cognitive tests (L1-L5). Hardware invariance validated.
v2.0Jan 18Physics-based cognitive tests. ENT-PLASTICITY. ENT-COGNITIVE.
v1.3Jan 172 billion neuron validation. ENT-EMBODIED (7 markers).
v1.1Jan 16ENT-NOVELTY. ENT-ASSOC. ENT-SEQUENCE. Memory and prediction.
v1.0Jan 15Initial suite. ENT-IQ5, ENT-EM5, ENT-INTER. Core validation.

Each version adds markers while maintaining backward compatibility with prior validations.

Verify Live

Request a live demonstration to observe benchmark tests in real-time.

Request Demo

NDA required for live demonstration.