ESTUDIO DE INGENIERÍA DE IA · MADRID · SECTORES REGULADOS AI ENGINEERING STUDIO · MADRID · REGULATED SECTORS

Ingeniería de IA
responsable, escalable
y medible. AI engineering
responsible, scalable
and measurable.

La arquitectura sigue al caso, no al ciclo de hype. Diseño y despliego sistemas de IA local-first cuando los datos no pueden salir del edificio — y cloud responsable en GCP o Azure cuando la escala global lo justifica. Una arquitectura, dos planos, una sola disciplina de ingeniería. Architecture follows the case, not the hype cycle. I design and deploy local-first AI systems when data cannot leave the building — and responsible cloud on GCP or Azure when global scale justifies it. One architecture, two planes, a single engineering discipline.

Consultoría especializada en Gobierno de Datos y AI Compliance para despachos, clínicas y fintechs operando bajo RGPD, secreto profesional sectorial y AI Act UE. Arquitecturas local-first auditables y pipelines GCP/Azure que documentan su superficie regulatoria desde la primera línea de código. Specialised consultancy in Data Governance and AI Compliance for law firms, clinics and fintechs operating under GDPR, sectoral professional secrecy and the EU AI Act. Auditable local-first architectures and GCP/Azure pipelines that document their regulatory surface from the first line of code.

Reservar consultoría Book consultation Ver servicios See services Leer principios → Read principles →

EmilianoVolpi.tox

typeFounder · AI Engineer

companyDigital Realism

locationMadrid · ES

roleData Engineer @ Accenture

stack_localLlama 4 Scout · Llama 3.3 70B · Qwen 3.5 · MLX

stack_cloudGemini · Vertex AI · Azure AI

stack_avTouchDesigner · Ableton · VCV

td_mcp→ Claude controls TD natively

CH 01 · LOCAL_COMPUTE

128GB

RAM unificada para
modelos frontera locales Unified memory for
local frontier models

CH 02 · CLOUD_CERTS

8×

certificaciones cloud activas
GCP · Azure · Databricks · Oracle active cloud certifications
GCP · Azure · Databricks · Oracle

CH 03 · INFERENCE_FOOTPRINT

~30W

consumo medio inferencia local
vs racks GPU completos average local inference draw
vs full GPU racks

CH 04 · MODELS_ORCHESTRATED

10×

modelos en producción
local + cloud híbrido models in production
local + hybrid cloud

SERVICIOS · NETWORK_VIEW SERVICES · NETWORK_VIEW

Siete operadores, una arquitectura coherente. Seven operators, one coherent architecture.

Cada propuesta combina ingeniería rigurosa con criterio práctico. La mejor solución es la que tu equipo entiende, mantiene y mide en resultados de negocio. Every proposal combines rigorous engineering with practical judgement. The best solution is the one your team understands, maintains, and measures in business outcomes.

[ 01 · INSIGNIA ]DAT · LOCAL

LegalVault.dat

Sistema RAG aislado por cliente para sectores regulados (legal, sanidad, finanzas). Indexa expedientes, jurisprudencia y plantillas internas; cada respuesta cita el documento fuente. Cumplimiento RGPD y secreto profesional by architecture, no por NDA. Despliegue local en Mac Mini servidor, M5 Max o, opcionalmente, en VPC privada GCP o Azure. Per-client isolated RAG system for regulated sectors (legal, healthcare, finance). Indexes case files, jurisprudence, and internal templates; every response cites its source document. GDPR and professional secrecy by architecture, not by NDA. Deploys on Mac Mini server, M5 Max, or optionally on GCP/Azure private VPC.

DEPLOYMENT: ON-PREMISE · OPCIONAL CLOUD VPC

AnythingLLMLlama 4 ScoutLlama 3.3 70BGLM-OCRChromaDBLlama Guard 3Vertex AI Search

[ 02 · A MEDIDA ]CHOP · HÍBRIDO

Agents.chop

Agentes IA type-safe en producción con memoria persistente, tests y observabilidad. Modelo agnóstico: ejecutan sobre Ollama local, Gemini, Claude o Vertex AI Agent Builder según el caso. Casos: clasificación documental, due diligence, atención cliente, automatización browser. Type-safe production AI agents with persistent memory, tests, and observability. Model-agnostic: run on local Ollama, Gemini, Claude, or Vertex AI Agent Builder per case. Use cases: document classification, due diligence, customer support, browser automation.

DEPLOYMENT: LOCAL · CLOUD · MIXED

Pydantic AILangGraphDeep AgentsBrowser UseMem0Langfuse

[ 03 · CLOUD ]TOP · GCP / AZURE

CloudEngine.top

Despliegues productivos serverless en Google Cloud y Azure con Gemini 2.5, Vertex AI Agent Builder, Cloud Run y Cloud Functions. La opción cuando el caso pide escala global o el cliente ya opera en cloud. Certificaciones activas: Professional Data Engineer + GenAI Leader + Cloud Digital Leader + Azure AI Fundamentals. Serverless production deployments on Google Cloud and Azure with Gemini 2.5, Vertex AI Agent Builder, Cloud Run, and Cloud Functions. The option when the case calls for global scale or the client already operates in cloud. Active credentials: Professional Data Engineer + GenAI Leader + Cloud Digital Leader + Azure AI Fundamentals.

DEPLOYMENT: GCP · AZURE · MULTI-CLOUD

Gemini 2.5Vertex AICloud RunCloud FunctionsBigQueryAzure AI Foundry

[ 04 · OPERACIONES ]COMP · SELF-HOSTED

Hyperflow.comp

Procesos manuales que dejan de serlo: extracción documental con OCR, integraciones cross-stack, automatizaciones web y workflows IA. Self-hosted por defecto (sin límites de ejecución ni costes por nodo); también deployable en cloud cuando se requiere alta disponibilidad gestionada. ROI medible en 90 días. Manual processes that stop being manual: document OCR extraction, cross-stack integrations, web automations, and AI workflows. Self-hosted by default (no execution limits or per-node costs); also deployable in cloud for managed high availability. Measurable ROI in 90 days.

DEPLOYMENT: DOCKER · KUBERNETES · CLOUD RUN

n8n self-hostedBrowser UsePlaywrightMCP serversComposio

[ 05 · VERTICAL ]SOP · VISION

Vision.sop

Computer Vision aplicada a problemas verticales: imagen biomédica (ultrasonido, radiología), object detection en imágenes satelitales, video analytics. Modelos custom (ResNet, U-Net, YOLO26) o LLMs multimodales (Gemini Vision, Claude Vision, Qwen3-VL). Experiencia académica con Premio al Mejor TFM por detección de sarcopenia mediante segmentación de ultrasonido. Computer Vision applied to vertical problems: biomedical imaging (ultrasound, radiology), object detection on satellite imagery, video analytics. Custom models (ResNet, U-Net, YOLO26) or multimodal LLMs (Gemini Vision, Claude Vision, Qwen3-VL). Academic background includes Best Thesis Award for sarcopenia detection via ultrasound segmentation.

DEPLOYMENT: ON-PREMISE · GCP VERTEX AI · AZURE

PyTorch MPSResNet-50YOLO26Gemini VisionOpenCVCoreML

[ 06 · INSTALACIONES AV ]TOX · LIVE

Stage.av

Instalaciones interactivas, VJ sets y experiencias audio-reactivas con TouchDesigner orquestado por Claude vía MCP server. AI Director (qwen3.5 cada 30s) ajusta parámetros en vivo. Voz sintética propia con VoiceBox + Qwen3-TTS. Para eventos corporativos, museos, marcas que buscan algo más allá del PowerPoint. Interactive installations, VJ sets, and audio-reactive experiences with TouchDesigner orchestrated by Claude via MCP server. AI Director (qwen3.5 every 30s) adjusts parameters live. Custom synthetic voice with VoiceBox + Qwen3-TTS. For corporate events, museums, and brands looking beyond PowerPoint.

DEPLOYMENT: ON-SITE · LIVE · NDI / SYPHON

TouchDesignerTD MCPAbleton LiveVoiceBoxQwen3-TTSRemotion

[ 07 · ADVISORY ]MAT · SESSIONS

AdvisoryBoard.mat

Workshops para equipos técnicos, auditoría de iniciativas IA existentes, hojas de ruta realistas para integrar IA sin caer en el hype. Sesiones presenciales en Madrid o remotas. Diseñado para CTOs, VP Engineering y comités de innovación. Workshops for technical teams, audits of existing AI initiatives, realistic roadmaps to integrate AI without hype. In-person in Madrid or remote sessions. Designed for CTOs, VP Engineering, and innovation committees.

Solicitar agenda → Request schedule →

ARQUITECTURAS · DEPLOYABLE_PATTERNS ARCHITECTURES · DEPLOYABLE_PATTERNS

Cuatro patrones probados en producción. Four patterns proven in production.

No vendo IA genérica: vendo arquitecturas con justificación técnica clara y métricas concretas. Estos cuatro patrones cubren el 90% de los casos reales de un cliente B2B en sectores regulados o intensivos en datos. I don't sell generic AI: I sell architectures with clear technical justification and concrete metrics. These four patterns cover 90% of real cases for a B2B client in regulated or data-intensive sectors.

[ A · LOCAL-FIRST ]RGPD · Art. 9.2.h

Soberanía total. Cero red exterior.Total sovereignty. Zero external network.

Para casos donde el dato no puede salir del edificio: despachos, clínicas, defensa. Mac Mini M4/M5 servidor o M5 Max ejecutando modelos open-weight (Llama 4 Scout, Llama 3.3 70B). Cero llamadas a internet en path crítico. El hardware escala bajo pedido — desde un único Mac Mini M4 hasta un cluster con Tailscale mesh. For cases where data cannot leave the building: law firms, clinics, defence. Mac Mini M4/M5 server or M5 Max running open-weight models (Llama 4 Scout, Llama 3.3 70B). Zero internet calls in critical path. Hardware scales on demand.

DICOM/PDF → OCR (GLM) → Embeddings → ChromaDB ↓ Llama 4 Scout / Llama 3.3 70B (Ollama) ↓ Streamlit / Hono local · Visor en LAN (Llama Guard 3 → Langfuse audit trail)

Latencia p50~400ms Energía~30W Casos realesLegalVault · Hygeia Escala1 → N nodos · Tailscale mesh

[ B · HYBRID ]VPC · multi-region

Lo sensible local. Lo escalable cloud.Sensitive local. Scalable cloud.

Cuando el cliente quiere escalar sin comprometer aislamiento. Capa local en Mac Mini para PII; capa Vertex AI Agent Builder + Cloud Run para razonamiento sobre datos anonimizados. La frontera entre los dos planos queda documentada y auditable. When the client needs to scale without compromising isolation. Local Mac Mini layer for PII; Vertex AI Agent Builder + Cloud Run layer for reasoning over anonymised data. The boundary between the two planes is documented and auditable.

Edge gateway local (PII redaction) ↓ datos anonimizados VPC privado GCP (europe-southwest1) ↓ Vertex AI Agent Builder + Cloud Run + BigQuery ↓ Resultado regresa al gateway → cliente (zero raw PII en cloud · DPIA documentado)

Throughput~500 q/s Coste/100 usuarios~600€/mes Casos realesDeltaEngine pattern ComplianceRGPD + AI Act

[ C · CLOUD-NATIVE ]Serverless · GCP

RAG productivo en Cloud Run.Production RAG on Cloud Run.

Cuando el caso es público o con datos no sensibles. Patrón de dos pasos — indexing offline + serving lightweight — que evita cold starts y mantiene coste sub-céntimo por query. When the case is public or non-sensitive. Two-step pattern — offline indexing + lightweight serving — that avoids cold starts and keeps cost sub-cent per query.

[Step 1 · indexing offline] Cloud Storage → Cloud Run job → Embeddings → Vertex AI Search index [Step 2 · serving] Hono.js Edge → Cloud Run → Gemini 2.5 → Vertex AI Search retrieval → response (streaming SSE)

Cold start<500ms Coste / 1K queries$0.30 Casos realesAttentionRAG Escalaglobal · auto

[ D · EDGE-AI ]Cloudflare · sub-100ms

Inferencia en el edge global.Inference at the global edge.

Cuando latencia < 200ms es requisito. Modelos pequeños (Phi-4, Gemma 2) corriendo en Workers AI o Cloudflare AI; fallback a Groq cuando el caso pide modelo más grande pero igual de rápido. When latency < 200ms is a requirement. Small models (Phi-4, Gemma 2) running on Workers AI or Cloudflare AI; Groq fallback when the case calls for a bigger but equally fast model.

User → Cloudflare Worker (Hono.js) ↓ Workers AI (Llama 3.2 1B) → TTFT ~80ms ↓ (escalación condicional) Groq Cloud (Llama 4 Scout 800 t/s) ↓ Streaming SSE → user

TTFT p50~80ms Coste / 1K queries~$0.01 Casos realesdemo chat de este sitio Escalaglobal · 300+ POPs

La arquitectura adecuada no se imagina — se despliega, se audita, se mantiene. Cada uno de estos cuatro patrones tiene métricas medibles, no promesas. The right architecture is not imagined — it is deployed, audited, maintained. Each of these four patterns has measurable metrics, not promises. — PRINCIPIO RECTOR · DIGITAL REALISM

DEMO · CONSULT_AGENT() DEMO · CONSULT_AGENT()

Pregunta al agente.
Lo que ves es lo que el cliente experimenta. Ask the agent.
What you see is what the client experiences.

Esta conversación se ejecuta sobre Llama 4 Scout / Llama 3.3 70B corriendo localmente en mi MacBook Pro M5 Max. Sin tokens cloud, sin latencia de red, sin datos cruzando internet del lado del cliente. Es exactamente la misma stack que tus usuarios usarían en producción sobre un Mac Mini servidor en tu sede. This conversation runs over Llama 4 Scout / Llama 3.3 70B locally on my MacBook Pro M5 Max. No cloud tokens, no network latency, no client data crossing the internet. It's exactly the same stack your users would use in production on a Mac Mini server at your office.

Sustituyendo el modelo por Gemini 2.5 Pro o Claude Opus, el mismo flujo se despliega en GCP/Vertex AI cuando el caso pide escala global. Swap the model for Gemini 2.5 Pro or Claude Opus and the same flow deploys on GCP/Vertex AI when the case calls for global scale.

Llama 4 ScoutLlama 3.3 70BChromaDBPydantic AIStreaming SSE

DigitalRealism.agent OPERATIONAL MODEL: Llama 4 Scout LOCAL

TRABAJO REALIZADO · CASES SHIPPED WORK · CASES

Lo que ya he construido. What I've already built.

Selección de proyectos open-source desde mi TFM premiado en Universidad Complutense hasta despliegues activos en Google Cloud. Cada caso enlaza a su repositorio cuando aplica. A selection of open-source projects, from my award-winning M.Sc. thesis at Universidad Complutense to active deployments on Google Cloud.

[ HEALTHCARE · CV ]SOP

★ PREMIO MEJOR TFM · UCM 2024

Hygeia.cv

Detección y monitorización de sarcopenia mediante segmentación de imágenes de ultrasonido con redes neuronales profundas (ResNet-50 + U-Net). Aplicación clínica que asiste a profesionales sanitarios en diagnóstico y seguimiento. AUC 0.93 sobre holdout. Sarcopenia detection and monitoring via ultrasound image segmentation with deep neural networks (ResNet-50 + U-Net). Clinical application assisting healthcare professionals. AUC 0.93 on holdout.

PyTorch MPS · ResNet-50 · U-Net · OpenCV

[ HEALTHCARE · LLM ]DAT

Hygeia2.rag

Asistente conversacional multi-dominio basado en LLaMA 3.1 (8B) con fine-tuning supervisado (LoRA + cuantización 4-bit). Soporte para profesionales de biomedicina, IA y medicina deportiva. Multi-domain conversational assistant based on LLaMA 3.1 (8B) with supervised fine-tuning (LoRA + 4-bit quantisation). Supports biomedicine, AI, and sports medicine professionals.

LLaMA 3.1 · LoRA · PEFT · Streamlit

[ GCP · RAG ]TOP

AttentionRAG.cloud

Asistente RAG experto en el paper "Attention Is All You Need" desplegado en producción en Google Cloud. Arquitectura de dos pasos que resuelve el problema de cold start en serverless. RAG assistant specialised in "Attention Is All You Need" deployed in production on Google Cloud. Two-step architecture that solves the serverless cold-start problem.

Gemini 2.5 · LangChain · Cloud Run · GCS

[ GCP · AGENT ]CHOP

CloudSpec.ai

Agente multimodal que convierte sketches de pizarra y prompts en lenguaje natural en especificaciones de arquitectura cloud GCP, con pricing en tiempo real. Server-Driven UI vía protocolo A2UI. Multimodal agent that converts whiteboard sketches and natural-language prompts into GCP cloud architecture specs, with real-time pricing.

Vertex AI Gemini 2.0 · FastAPI · Cloud Run

[ ENERGY · CV ]SOP

SolarDetect.cv

Detección automática de paneles fotovoltaicos sobre imágenes satelitales de tejados (ResNet-34 fine-tuned). Aplicación: mapeo energético urbano y planificación de instalaciones renovables. Automatic photovoltaic panel detection on rooftop satellite imagery (ResNet-34 fine-tuned). Application: urban energy mapping and renewable installation planning.

ResNet-34 · Object Detection · fast.ai

[ NLP · DATA ]MAT

SentimentMusic.dat

Análisis de sentimiento sobre Billboard Hot 100 (29.503 canciones) cruzando audio features de Spotify para clasificar tracks en categorías emocionales mediante queries MongoDB. Sentiment analysis over Billboard Hot 100 (29,503 songs) cross-referencing Spotify audio features to classify tracks into emotional categories via MongoDB queries.

MongoDB · Aggregation Pipeline · Audio Features

[ ENTERPRISE · DATA ENGINEERING ]TOP

★ EXPERIENCIA ACTIVA · NDAACTIVE EXPERIENCE · NDA

DeltaEngine.gcp

Motores de procesamiento incremental near-real-time sobre GCP en sectores de banca y energía. CDC desde NoSQL hacia BigQuery, patrones HOT/COLD con idempotencia estricta. Trabajo en mi rol como Data Engineer en Accenture: detalles bajo NDA. Near-real-time incremental processing engines on GCP in banking and energy sectors. CDC from NoSQL to BigQuery, HOT/COLD patterns with strict idempotency. Developed as Data Engineer at Accenture: details NDA-protected.

BigQuery · Cloud Functions Gen2 · Pub/Sub · Terraform

COMPLIANCE · CERTIFICATIONS_BOARD COMPLIANCE · CERTIFICATIONS_BOARD

Ingeniería certificada. Datos bajo control. Certified engineering. Data under control.

La innovación técnica sin rigor regulatorio es deuda escondida. Cada arquitectura entregada viene respaldada por certificaciones activas en GCP, Azure y Databricks. Technical innovation without regulatory rigour is hidden debt. Every architecture shipped is backed by active certifications in GCP, Azure and Databricks.

Google CloudACTIVE

Professional Data Engineer

SEP 2023 — SEP 2027RECERTIFIED

Diseño de sistemas de procesamiento de datos a escala en GCP: BigQuery, Dataflow, Pub/Sub, Cloud Run. Foco en pipelines fiables, seguros y normativamente compliant.

Google CloudACTIVE

Generative AI Leader

NOV 2025 — NOV 2028STRATEGIC

Dirección estratégica de iniciativas GenAI corporativas. Evaluación de casos de uso, mitigación de riesgos (alucinaciones, sesgo, fuga de datos) y diseño para ROI medible.

Google CloudACTIVE

Cloud Digital Leader

OCT 2025 — OCT 2028FOUNDATIONAL

Visión transversal del ecosistema Google Cloud: mapeo de necesidades de negocio a productos cloud, casos de adopción y estrategia de modernización empresarial.

Accenture × Stanford HAIACTIVE

Reinvention with Agentic AI

DEC 2025EXECUTIVE

Programa ejecutivo en agentic AI con material licenciado de Stanford HAI. Frameworks de adopción corporativa, casos productivos y reinvención de procesos con autonomía controlada.

MicrosoftACTIVE

Azure AI Fundamentals

MAR 2026AI-900

Cargas de ML e IA sobre Azure: ML Studio, Cognitive Services, Azure AI Foundry. Ideal para clientes con stack Microsoft establecido o requisitos de soberanía europea Azure.

DatabricksACTIVE

Data Intelligence Fundamentals

MAY 2025ACCREDITED

Lakehouse, Delta Lake, Unity Catalog y MLflow. Arquitecturas RAG productivas y vector search dentro del ecosistema Data Intelligence Platform.

OracleACTIVE

OCI Foundations Associate

SEP 2024OCI 2024

Fundamentos de Oracle Cloud Infrastructure: cómputo, almacenamiento, redes y seguridad. Cobertura para clientes regulados con stack Oracle establecido.

Coursera × StanfordVERIFIED

AI for Medicine

OCT 20243-COURSE SPEC.

Especialización Stanford de tres cursos en aplicación de ML a problemas médicos: diagnóstico (radiografías, resonancias), pronóstico clínico y estimación del efecto de tratamientos.

Premio al Mejor TFM · Universidad Complutense de Madrid

Trabajo de Fin de Máster premiado en Data Science, Big Data & Business Analytics (UCM 2024) por Hygeia — sistema de detección de sarcopenia mediante segmentación de imágenes de ultrasonido con redes neuronales profundas. Calificación 9,3/10 · Promoción 2023-2024 presencial.

NTIC MASTER × UCM
FEB · 2025

● Todas las credenciales son verificables públicamente vía Credly · Google Cloud · Microsoft Learn. Copias originales en PDF disponibles bajo petición a emilianovolpi@digitalrealism.io.

Soberanía de datos por diseño. Data Sovereignty & Regulatory Posture.

Tus datos viven donde tú decides, no donde es cómodo para el proveedor de turno. Cuando el caso lo permite, el cómputo se queda en tu edificio. Cuando exige escala, lo desplegamos en cloud privada con la misma calidad de ingeniería. Diseñamos para clientes en sectores regulados (legal, sanidad, finanzas) donde el dato no puede abandonar la jurisdicción del cliente. Cada despliegue documenta su superficie de cumplimiento.

[ p01 ] DATA RESIDENCY

Despliegues on-premise o en VPC privada con regiones controladas. Trazabilidad documental de cada copia y derivado del dato fuente.

[ p02 ] AUDIT TRAIL

Langfuse self-hosted en infraestructura del cliente. Cada inferencia queda registrada y es auditable a perpetuidad.

[ p03 ] AI SAFETY GATE

Llama Guard 3 actuando como filtro previo a cualquier salida del agente. PII detection, toxicity scoring y bloqueo configurable.

FOUNDER · PORTRAIT_NODE FOUNDER · PORTRAIT_NODE

EmilianoVolpi.portrait

Emiliano Volpi — Founder & Principal Engineer, Digital Realism

● CAPTURED · MADRID 2026 v2026.05

Una agencia es una persona con un criterio.Empresa unipersonal con acreditaciones corporativas verificables y experiencia enterprise activa. An agency is one person with judgement.A solo studio with verifiable corporate credentials and active enterprise experience.

Soy Emiliano Volpi. Diseño y construyo sistemas de IA en producción desde 2024, con un foco editorial en lo que sirve realmente al cliente: arquitecturas que duran, que se auditan, y que respetan el dato como activo crítico.

Trabajo de día como Data Engineer en proyectos enterprise de banca y energía sobre Google Cloud Platform — pipelines críticos, FinOps disciplinado, observabilidad de datalakes Tier-1 con BigQuery, Cloud Functions, Datastream y Terraform.

Antes: Máster en Data Science con Premio al Mejor TFM (UCM 2024), computer vision biomédica para detección de sarcopenia, pre-sales cloud en Telefónica Tech.

Founder & Principal Engineer en Digital Realism. Actualmente Data Engineer & Governance Analyst en Accenture (Madrid) operando proyectos enterprise sobre Google Cloud Platform en sectores de banca y energía.

Acreditaciones activas en las tres principales nubes (GCP × 3, Azure AI Fundamentals, Oracle OCI) más Databricks y formación ejecutiva certificada en Agentic AI por Accenture × Stanford HAI. Calificación 9,3/10 en Máster UCM con Premio al Mejor TFM (2024).

I'm Emiliano Volpi. I've been designing and building production AI systems since 2024, with an editorial focus on what actually serves the client: architectures that last, that audit cleanly, and that treat data as a critical asset.

By day I work as a Data Engineer on enterprise projects in banking and energy on Google Cloud Platform — critical pipelines, disciplined FinOps, Tier-1 datalake observability.

Founder & Principal Engineer at Digital Realism. Currently Data Engineer & Governance Analyst at Accenture (Madrid) operating enterprise projects on Google Cloud Platform in banking and energy sectors.

FounderEmiliano Luis Volpi

Día actualDay roleData Engineer @ Accenture · Madrid

EstudioStudioDigital Realism · founder

BaseBased inMadrid · ES · UTC+1

LinkedIn/emiliano-volpi

Línea directaDirect lineemilianovolpi@digitalrealism.io

PHILOSOPHY · ENGINEERING_PRINCIPLES

Principios del Realismo Digital. Principles of Digital Realism.

«Realismo digital: lo que pasa de la ficción a la realidad cuando se mide. La arquitectura adecuada no se imagina — se despliega, se audita, se mantiene.» "Digital realism: what crosses from fiction to reality when it gets measured. The right architecture is not imagined — it is deployed, audited, maintained."

— PRINCIPIO RECTOR · v2026.05 — GUIDING PRINCIPLE · v2026.05

Cada decisión técnica deja huella — energética, regulatoria, contractual. Estos principios articulan los criterios de ingeniería del estudio: no son un manifiesto político, son las restricciones reales con las que se firma cada arquitectura. Every technical decision leaves a trace — energetic, regulatory, contractual. These principles articulate the studio's engineering criteria: not a political manifesto but the actual constraints under which every architecture is signed.

[ p01 · FIELDS OF SENSE ]

La arquitectura sigue al campo de sentido, no al ciclo de hype. Architecture follows the field of sense, not the hype cycle.

No existen soluciones universales porque "el mundo" no existe como totalidad. Local-first cuando el dato es sagrado; cloud cuando la escala es el fin. El criterio se firma; la moda no. There are no universal solutions because "the world" does not exist as a totality. Local-first when data is sacred; cloud when scale is the goal. Criteria get signed; trends don't.

Markus GabrielP. Data Engineer

[ p02 · DOCUMENTALITY ]

El dato es una inscripción social inemendable. Data is an unamendable social inscription.

Un contrato o historial clínico es una responsabilidad antes que un activo. Priorizamos arquitecturas que blinden la documentalidad: el control efectivo del registro reside en el generador, no en el proveedor. A contract or medical record is a liability before it is an asset. We prioritize architectures that shield documentality: effective control of the record stays with the generator, not the vendor.

Maurizio FerrarisRGPD by Design

[ p03 · ENERGY AS RESISTANCE ]

Cada vatio es resistencia material medida. Every watt is measured material resistance.

La realidad es física. Un Mac Mini a 30W ejecutando modelos locales no es una opción ideológica: es resistencia contra el consumo extractivo. Hardware es destino. Reality is physical. A 30W Mac Mini running local models is not an ideological choice: it is resistance against extractive consumption. Hardware is destiny.

Yuk HuiM5 Max Performance

[ p04 · WHITE-BOX AUDIT ]

Trazabilidad contra la algocracia. Traceability against algocracy.

Contra la caja negra, implementamos observabilidad total. Cada decisión de la IA es un documento auditable que el profesional puede validar. Lo que no se mide se pierde; lo que se audita se defiende. Against the black box, we implement total observability. Every AI decision is an auditable document that the professional can validate. What isn't measured gets lost; what is audited can be defended.

Éric SadinAudit Trail

[ p05 · DE-PROLETARIZATION ]

Sistemas que devuelven el saber-hacer. Systems that return know-how.

La IA debe ampliar el juicio humano, no sustituirlo. Diseñamos arquitecturas mantenibles con Open Source para evitar la alienación cognitiva del cliente. Si no puedes mantenerlo, no eres libre. AI should expand human judgment, not replace it. We design maintainable architectures with Open Source to avoid client cognitive alienation. If you can't maintain it, you're not free.

Bernard StieglerOpen Source

[ p06 · SOVEREIGN TIME ]

Des-cancelar el futuro: ecología del tiempo. Un-canceling the future: ecology of time.

Contra la parálisis del "Realismo Capitalista", usamos IA local no para acelerar la fatiga, sino para absorber el back-office. Automatizar es una decisión política: sirve para devolverle al profesional su tiempo soberano. Against the paralysis of "Capitalist Realism", we use local AI not to accelerate fatigue, but to absorb back-office tasks. Automating is a political decision: it returns sovereign time to the professional.

Mark FisherAndré Gorz

[ p07 · MACHINE OF PERCEPTION ]

Rediseñar el reparto de lo sensible. Redesigning the distribution of the sensible.

La Visión Computacional no solo calcula; percibe. Al detectar patrones subclínicos o legales antes que el ojo humano, reconfiguramos lo que es visible en una organización. Hacemos de la IA una lente, no un reemplazo. Computer Vision doesn't just calculate; it perceives. By detecting patterns before the human eye, we reconfigure what is visible in an organization. We make AI a lens, not a replacement.

Gilles DeleuzeJacques Rancière

[ p08 · ANTI-NATURALISM ]

Automatización emancipadora y Xenofeminismo. Emancipatory automation and Xenofeminism.

Rechazamos el determinismo biológico. La tecnología es un campo de batalla que debe ser reprogramado para abolir los binarios opresivos y sostener infraestructuras de cuidado no normativas. We reject biological determinism. Technology is a battlefield that must be reprogrammed to abolish oppressive binaries and sustain non-normative infrastructures of care.

Helen HesterLaboria Cuboniks

[ p09 · INTERPASSIVITY ]

Romper el cinismo: inscribir el "NO" ético. Breaking cynicism: inscribing the ethical "NO".

La tecnología nos vuelve "interpasivos": la máquina actúa y decide moralmente por nosotros. Para recuperar el deseo de construir, inscribimos la prohibición ética en la base de datos. Technology makes us "interpassive": the machine acts and makes moral decisions for us. To recover the desire to build, we inscribe ethical prohibition into the database.

Slavoj ŽižekRobert Pfaller

[ p10 · HYPERSTITION ]

Ingeniería iterativa y contención de riesgos. Iterative engineering and risk containment.

El código hace real lo que enuncia (Hipersticición). Pero ante inteligencias agénticas, exigimos contención. Desplegamos iterativamente con guardarraíles estrictos. Ship. Iterate. Measure. Code makes real what it states (Hyperstition). But facing agentic intelligences, we demand containment. We deploy iteratively with strict guardrails.

Kent BeckMustafa Suleyman

Versión extendida de estos diez principios — con su linaje crítico completo y autores referenciados. Lo que aquí cabe en diez tarjetas, allí se despliega en diez ensayos: Extended version of these ten principles — with their full critical lineage and referenced authors. What fits here in ten cards, unfolds there in ten essays:

Realismo Digital · los diez principios extendidos Digital Realism · the ten extended principles digitalrealism.art/manifesto.html

STUDIO · HARDWARE_PATCH STUDIO · HARDWARE_PATCH

El estudio que ejecuta todo lo anterior. The studio that runs all of the above.

El setup real desde el que se diseñan, prueban y entregan los proyectos. M5 Max 128GB ejecutando modelos frontera localmente, hardware musical conectado por audio-rate, TouchDesigner controlado por Claude vía MCP. The actual setup where projects are designed, tested and shipped. M5 Max with 128GB running frontier models locally, musical hardware at audio-rate, TouchDesigner controlled by Claude via MCP.

[ 01 ]SIGNAL INPUTS · MIDI + AUDIO RATE

NORDSSYNTH

Nord Stage 2

76 keys · piano · organ · synth

NORDSSYNTH

Nord Lead A1

analog modeling · 4-part

YAMAHAFM

Reface DX

FM 8-op · 37 keys

ROLANDRHYTHM

TR-8

drum machine · USB audio

[ 02 ]INTERFACE + DAW · SIGNAL ROUTING

MOTUDAC

M6 Interface

ESS Sabre32 · 6in/4out · USB-C

NOVATIONCTRL

Launchkey MK3 49

MIDI controller · DAW integration

ABLETONDAW

Live 12 Suite

48k · 128 smp · Max for Live

VCVMODULAR

VCV Rack 2

virtual eurorack · ARM64 native

[ 03 ]BRAIN · COMPUTE + AI ORCHESTRATION

APPLECOMPUTE

M5 Max

128GB unified · 4TB SSD · 614GB/s

OLLAMALLMs

10× modelos locales

Llama 4 Scout · Llama 3.3 70B · Qwen 3.5

CLAUDEAGENT

Claude Code + MCP

Opus 4.7 · Sonnet 4.6 · Haiku 4.5

VOICEBOXTTS

Qwen3-TTS Studio

voice cloning · 23 idiomas

[ 04 ]VISUAL OUTPUT · TOUCHDESIGNER + AI DIRECTOR

DERIVATIVEVISUAL

TouchDesigner

controlado por Claude vía MCP

OLLAMADIRECTOR

qwen3.5 AI Director

ajusta parámetros TD cada 30s

EXISTENTIALPIPE

BlackHole 16ch

Ableton → TouchDesigner audio

SYPHON / NDISHOW

Projector + OBS

live · streaming · Resolume

[ 05 ]FUTURE FLEET · ROADMAP 2026–2027

APPLE · Q3 2026SERVER

Mac Mini M5 Pro

48GB · 24/7 · always-on inference

APPLE · Q4 2026RESEARCH

Mac Studio M5 Ultra

256GB · fine-tune · large models

SYNOLOGY · 2027STORAGE

NAS DS923+ ZFS

backup · audit logs · Restic

TAILSCALEMESH

Private overlay

m5max + minisrv + studio + nas

SECURITY:FileVault · Llama Guard 3 · Tailscale · 1Password (4 vaults) UPTIME:24/7 · daily backup cifrado CARBON:~30W avg · vs ~4kW datacenter equivalente

STACK · LIVE_NETWORK STACK · LIVE_NETWORK

Lo que realmente usamos en producción. What we actually run in production.

No es marketing. Cada operador de esta tabla se ejecuta hoy en mi MacBook Pro M5 Max o en infraestructura cloud de clientes activos. No marketing. Every operator on this table runs today on my M5 Max or on active client cloud infrastructure.

OPS COUNT: 17

LOCAL ● CLOUD ● EDGE ●

LLMs

Llama 4 Scout · Llama 3.3 70B · Qwen 3.5 · DeepSeek V4 Flash · Claude Opus 4.7 · Sonnet 4.6 · Gemini 2.5 Pro/Flash

LOCAL+CLOUD

Agentes

Pydantic AI · LangGraph · Deep Agents · Browser Use · n8n · Claude Code · Vertex AI Agent Builder

LOCAL+CLOUD

Memoria · Embeddings

Mem0 (cross-agent) · Claude-Mem (session) · TEI · ChromaDB · sentence-transformers

LOCAL+CLOUD

RAG · OCR

AnythingLLM · ChromaDB · GLM-OCR (0.9B, #1 benchmark) · LangExtract · Vertex AI Search

LOCAL+CLOUD

Vision · ML

PyTorch MPS · MLX · ResNet-50 · U-Net · YOLO26 · Gemini Vision · Claude Vision · CoreML

LOCAL+CLOUD

Web

HTML · CSS · Vanilla JS · Astro (Q3 2026) · Hono · Cloudflare Pages · Tailwind v4

EDGE

GCP · AI

Gemini 2.5 Pro/Flash · Vertex AI Agent Builder · Vertex AI Studio · Model Garden · AutoML

CLOUD

GCP · Compute

Cloud Run · Cloud Functions Gen2 · GKE · Cloud Build · Artifact Registry

CLOUD

GCP · Data

BigQuery · Dataflow · Pub/Sub · Cloud Storage · Dataform · Looker · Datastream (CDC) · Cloud Scheduler

CLOUD

FinOps · BigQuery

Query optimization · execution plan analysis · partition pruning · semi-join reduction · SLA monitoring · ghost data detection

CLOUD

Azure

Databricks · Azure AI Foundry · Azure OpenAI · Functions · Cosmos DB · Azure DevOps (CI/CD)

CLOUD

Edge

Cloudflare Workers · Workers AI · Vercel Edge · Tailscale · Hono on Edge · Groq

EDGE

AV · Visual

TouchDesigner · TD MCP server · Resolume · Spline 3D · Remotion

LOCAL

AV · Audio

Ableton Live 12 · Max for Live · VCV Rack 2 · MOTU M6 · Nord Stage 2 · Reface DX · TR-8 · BlackHole 16ch

LOCAL

AV · Voice

VoiceBox (Qwen3-TTS, 23 idiomas) · Kokoro 82M · MLX-Whisper v3 · MusicGen 3.3B

LOCAL

Infra · IaC · Security

Terraform · Azure DevOps · Docker · OrbStack · Tailscale · 1Password · FileVault · Llama Guard 3 · CI/CD pipelines

LOCAL+CLOUD

Observabilidad

Langfuse self-hosted · Logfire · Postgres · Redis · Metabase

LOCAL

PUBLICACIONES · LATEST PUBLICATIONS · LATEST

Notas técnicas y reflexiones de campo. Technical notes and field reflections.

Aprendizajes prácticos de despliegues reales. Sin marketing, sin hype. Lo que funciona, lo que no, y qué probar la próxima vez. Practical learnings from real deployments. No marketing, no hype.

[ CASO · CV · TFM ] 2024 · UCM·6 MIN

Por qué la sarcopenia en deportistas pide computer vision local-first. Why sarcopenia in athletes demands local-first computer vision.

Notas sobre el TFM premiado en la Universidad Complutense (2024) realizado en colaboración con Dawako Med Tech. Por qué los datos de ultrasonido no pueden cruzar internet bajo Art. 9.2.h RGPD. Notes on the award-winning thesis at Universidad Complutense (2024), developed with Dawako Med Tech. Why ultrasound data cannot cross internet under GDPR Art. 9.2.h.

Leer artículo Read article Cerrar Close

Este texto comenta la arquitectura y el caso de uso de mi TFM en el Máster en Data Science de la UCM (2024), desarrollado en colaboración con Dawako Med Tech bajo NDA. El dataset y el código no son públicos. Pero el problema clínico, la lógica arquitectónica y las decisiones de despliegue sí pueden compartirse.

Por qué la sarcopenia

La sarcopenia — pérdida progresiva de masa y función muscular — dejó hace tiempo de ser un asunto exclusivo de la geriatría. En medicina deportiva de élite es un marcador temprano de sobreentrenamiento, recuperación incompleta o lesión inminente.

«Las métricas que importan en imagen biomédica no son las del paper. Son las que el clínico acepta cuando ve la salida superpuesta sobre la imagen que conoce.»

Cómo replantearía el despliegue hoy

Tres cosas cambiaría: uno, adiós Colab, hola Apple Silicon nativo (PyTorch MPS sobre Mac Mini M4); dos, audit trail incorporado al producto (Langfuse self-hosted); tres, cuantización int8 para producción, fp32 para reentrenamiento.

Dice score

0.89

AUC holdout

0.93

Latencia M4

<1s

Energía

~30W

This is a commentary on the architecture and use case of my M.Sc. thesis at UCM (2024), developed with Dawako Med Tech under NDA.

Why sarcopenia

Sarcopenia — the progressive loss of muscle mass and function — has long since stopped being a geriatric concern. In elite sports medicine it is an early marker of overtraining, incomplete recovery, or imminent injury.

"The metrics that matter in biomedical imaging are not the ones in the paper. They are the ones the clinician accepts when they see the output overlaid on the image they know."

How I would re-frame deployment today

Three changes: one, goodbye Colab, hello native Apple Silicon (PyTorch MPS on Mac Mini M4); two, audit trail built into the product (Langfuse self-hosted); three, int8 quantisation for production, fp32 for retraining.

Dice score

0.89

Holdout AUC

0.93

M4 latency

<1s

Energy

~30W

[ TUTORIAL · GCP ] 2026.03·6 MIN

RAG en Cloud Run con Gemini: el patrón de dos pasos que evita cold starts. RAG on Cloud Run with Gemini: the two-step pattern that avoids cold starts.

Separar indexing (offline) de serving (lightweight) es el patrón estándar para ML en serverless. Walkthrough completo con AttentionRAG. Separating indexing (offline) from serving (lightweight) is the standard ML serverless pattern.

Leer artículo Read article Cerrar Close

El error más común al desplegar un RAG en Cloud Run no es de modelo: es de arquitectura. La mayoría de los tutoriales empaquetan el indexing y el serving en el mismo container — un patrón que parece razonable hasta que el primer usuario espera 14 segundos.

El problema del cold start

Cloud Run es serverless por diseño: si nadie pide la URL durante 15 minutos, el container se apaga. Si dentro está cargando un modelo de embeddings de 400 MB, el TTFB del primer usuario salta a la luna.

«El cold start no es un problema técnico. Es una mala separación de fases.»

El patrón de dos pasos

La solución arquitectónica es separar las dos fases del RAG en dos servicios distintos: Cloud Run Job para indexing offline; Cloud Run Service con Hono.js para serving lightweight.

Cold start p50

~340ms

TTFT warm

~180ms

Coste/1K queries

$0.30

Min-instances

The most common mistake when deploying RAG on Cloud Run is not a model issue: it's an architectural one.

The cold-start problem

Cloud Run is serverless by design: if no one hits the URL for 15 minutes, the container shuts down.

"Cold start is not a technical problem. It is a phase-separation problem."

The two-step pattern

Split the two RAG phases into distinct services: Cloud Run Job for offline indexing; Cloud Run Service with Hono.js for lightweight serving.

Cold start p50

~340ms

Warm TTFT

~180ms

Cost/1K queries

$0.30

Min-instances

[ REFLEXIÓN ] 2026.02·5 MIN

Local vs cloud no es una guerra ideológica. Es una decisión técnica. Local vs cloud is not an ideological war. It's a technical decision.

Los datos sensibles van en local. La escala global va en cloud. Casi todo lo demás es ego del consultor o miedo del cliente. Sensitive data stays local. Global scale goes to cloud. Almost everything else is consultant ego or client fear.

Leer artículo Read article Cerrar Close

El debate local-first vs cloud-first se ha convertido en una guerra de identidad. Los dos bandos están equivocados — porque confunden una decisión arquitectónica con una posición tribal.

El falso dilema

La pregunta «¿local o cloud?» está mal planteada. La pregunta correcta es: «¿qué constraint del cliente define la arquitectura?». Y los constraints son técnicos, no ideológicos.

«La pregunta correcta no es 'local o cloud'. Es: '¿qué constraint del cliente define la arquitectura?'»

The local-first vs cloud-first debate has turned into an identity war. Both camps are wrong — because both confuse an architectural decision with a tribal stance.

The false dilemma

The question "local or cloud?" is poorly framed. The right question is: "which client constraint defines the architecture?".

"The right question is not 'local or cloud'. It's: 'which client constraint defines the architecture?'"

● 3 POSTS PUBLICADOS · TRIGGER ASTRO ALCANZADO 3 POSTS PUBLISHED · ASTRO TRIGGER REACHED MIGRACIÓN A ASTRO PROGRAMADA · Q3 2026 ASTRO MIGRATION SCHEDULED · Q3 2026

CONTACTO · CONNECT() CONTACT · CONNECT()

Hablemos de tu caso concreto. Let's talk about your specific case.

Una sesión de descubrimiento, sin compromiso, para entender qué problema quieres resolver y si nuestra forma de trabajar encaja con tu organización. A no-commitment discovery session to understand what problem you want to solve and whether our way of working fits your organisation.

// SCHEDULE_DISCOVERY()

30 minutos, formato remoto o presencial en Madrid. Te diré con sinceridad si tu caso encaja con alguno de nuestros servicios o si necesitas otra cosa. 30 minutes, remote or in-person in Madrid. I'll tell you honestly whether your case fits one of our services.

emilianovolpi@digitalrealism.io

● RESPONSE_TIME: 24h UTC+1 · MADRID

Ingeniería de IAresponsable, escalabley medible. AI engineeringresponsible, scalableand measurable.

Siete operadores, una arquitectura coherente. Seven operators, one coherent architecture.

LegalVault.dat

Agents.chop

CloudEngine.top

Hyperflow.comp

Vision.sop

Stage.av

AdvisoryBoard.mat

Cuatro patrones probados en producción. Four patterns proven in production.

Soberanía total. Cero red exterior.Total sovereignty. Zero external network.

Lo sensible local. Lo escalable cloud.Sensitive local. Scalable cloud.

RAG productivo en Cloud Run.Production RAG on Cloud Run.

Inferencia en el edge global.Inference at the global edge.

Pregunta al agente.Lo que ves es lo que el cliente experimenta. Ask the agent.What you see is what the client experiences.

Lo que ya he construido. What I've already built.

Hygeia.cv

Hygeia2.rag

AttentionRAG.cloud

CloudSpec.ai

SolarDetect.cv

SentimentMusic.dat

DeltaEngine.gcp

Ingeniería certificada. Datos bajo control. Certified engineering. Data under control.

Professional Data Engineer

Generative AI Leader

Cloud Digital Leader

Reinvention with Agentic AI

Azure AI Fundamentals

Data Intelligence Fundamentals

OCI Foundations Associate

AI for Medicine

Premio al Mejor TFM · Universidad Complutense de Madrid

Soberanía de datos por diseño. Data Sovereignty & Regulatory Posture.

[ p01 ] DATA RESIDENCY

[ p02 ] AUDIT TRAIL

[ p03 ] AI SAFETY GATE

Una agencia es una persona con un criterio.Empresa unipersonal con acreditaciones corporativas verificables y experiencia enterprise activa. An agency is one person with judgement.A solo studio with verifiable corporate credentials and active enterprise experience.

Principios del Realismo Digital. Principles of Digital Realism.

La arquitectura sigue al campo de sentido, no al ciclo de hype. Architecture follows the field of sense, not the hype cycle.

El dato es una inscripción social inemendable. Data is an unamendable social inscription.

Cada vatio es resistencia material medida. Every watt is measured material resistance.

Trazabilidad contra la algocracia. Traceability against algocracy.

Sistemas que devuelven el saber-hacer. Systems that return know-how.

Des-cancelar el futuro: ecología del tiempo. Un-canceling the future: ecology of time.

Rediseñar el reparto de lo sensible. Redesigning the distribution of the sensible.

Automatización emancipadora y Xenofeminismo. Emancipatory automation and Xenofeminism.

Romper el cinismo: inscribir el "NO" ético. Breaking cynicism: inscribing the ethical "NO".

Ingeniería iterativa y contención de riesgos. Iterative engineering and risk containment.

El estudio que ejecuta todo lo anterior. The studio that runs all of the above.

Lo que realmente usamos en producción. What we actually run in production.

Notas técnicas y reflexiones de campo. Technical notes and field reflections.

Por qué la sarcopenia en deportistas pide computer vision local-first. Why sarcopenia in athletes demands local-first computer vision.

Por qué la sarcopenia

Cómo replantearía el despliegue hoy

Why sarcopenia

How I would re-frame deployment today

RAG en Cloud Run con Gemini: el patrón de dos pasos que evita cold starts. RAG on Cloud Run with Gemini: the two-step pattern that avoids cold starts.

El problema del cold start

El patrón de dos pasos

The cold-start problem

The two-step pattern

Local vs cloud no es una guerra ideológica. Es una decisión técnica. Local vs cloud is not an ideological war. It's a technical decision.

El falso dilema

The false dilemma

Hablemos de tu caso concreto. Let's talk about your specific case.

// SCHEDULE_DISCOVERY()

Ingeniería de IA
responsable, escalable
y medible. AI engineering
responsible, scalable
and measurable.

Pregunta al agente.
Lo que ves es lo que el cliente experimenta. Ask the agent.
What you see is what the client experiences.