Enterprise Model Factory · v2.4 Enterprise Model Factory · v2.4

기업의 AI 모델을, 처음부터 끝까지. Build your company's AI end to end.

모델 선정 · 데이터셋 · 학습 · 평가 · 패키징 · 배포까지 — 한 워크벤치에서 끝납니다. 실행 로그는 다시 학습 데이터가 되어, 쓸수록 똑똑해지는 모델을 만듭니다. Model selection, datasets, training, evaluation, packaging, deployment — finished on one workbench. Execution logs feed the next training round, so the model improves as it runs.

무료로 시작하기Start free

leafcloud.space/factory

LIVE RUN #482

FACTORY PIPELINE · 7 STAGES

04 · ADAPTER TRAIN — ETA 02:14

SelectCompareDatasetTrainEvalPackDeploy

EVENT STREAM

tail — last 5 events

12:47:21dataset.distill · +142 SFT pairs from runtime traces+0.8s

12:47:23train.lora · epoch 3 step 1247 · loss 0.342+2.1s

12:47:25eval.tool_call · pass 94.2% (+1.3%)+1.4s

12:47:26pack.assemble · brain ✓ capability ✓ behavior ✓+0.3s

12:47:28runtime.gateway · listening on :8080+0.6s

Model Factory· Behavior Pattern· Agent Harness· Runtime Loop· 4 Pillars· 7-Stage Pipeline· Intelligence Pack (.leaf)· Capability Learning· Multimodal Stack· Federated Learning· Privacy by Architecture· Runtime Gateway· Local & Cloud Deploy· QLoRA · DPO· Tool-use · MCP · RAG· Skill Orchestration· Self-evolution· Pricing· Model Factory· Behavior Pattern· Agent Harness· Runtime Loop· 4 Pillars· 7-Stage Pipeline· Intelligence Pack (.leaf)· Capability Learning· Multimodal Stack· Federated Learning· Privacy by Architecture· Runtime Gateway· Local & Cloud Deploy· QLoRA · DPO· Tool-use · MCP · RAG· Skill Orchestration· Self-evolution· Pricing·

V1 통합

Architecture · 4 Pillars

네 개의 축이 모델을 스스로 진화시킵니다 Four pillars. One self-improving model.

생산 · 데이터 · 실행 · 진화 — 네 개의 축이 하나의 루프로 이어집니다. Production · Behavior · Execution · Evolution. Four axes locked into a single loop.

PRODUCTION

모델 팩토리Model Factory

선정 → 데이터 → LoRA → 양자화 → 패키징. 회사 목적에 맞는 모델을 직접 생산합니다. Select → dataset → LoRA → quantize → package. Mint a model that fits your goal.

BEHAVIOR

행동 패턴Behavior Pattern

언제 스킬·툴·MCP를 호출할지, 실패 시 어떻게 복구할지를 데이터로 학습합니다. When to call skills, tools, MCP — and how to recover from failure.

EXECUTION

에이전트 하네스Agent Harness

스킬 · 툴 · MCP · 루프 7개 모듈이 결합해 모델을 자율 에이전트로 만듭니다. Seven modules — skills, tools, MCP, loop — turn the model into an autonomous agent.

EVOLUTION

런타임 러닝Runtime Learning

실행 로그가 다시 SFT/DPO 학습 데이터가 되어, 쓸수록 모델이 강해집니다. Execution logs become SFT/DPO data. The more it runs, the stronger it gets.

The Problem

빌려 쓰는 지능은 오래가지 못합니다 Borrowed intelligence won't last

외부 API에만 의존하는 AI 도입이 두 번째 해를 넘기지 못하는 이유. Why API-dependent AI rarely survives the second year.

비용 폭증Runaway cost

사용량이 늘수록 API 비용이 선형 이상으로 커집니다. API cost scales super-linearly with usage.

데이터 유출 위험Data egress risk

문서 · 코드 · 고객 데이터가 외부로 나가는 순간 통제력이 사라집니다. Once docs, code, and customer data leave your VPC, control is gone.

도메인 무지No domain memory

범용 모델은 우리 회사 문서 · 프로세스 · 도구를 모릅니다. General models don't know your docs, process, or tools.

학습 정체No feedback loop

실패 데이터를 학습하지 않으면 같은 실수를 반복합니다. Without ingesting failures, the model repeats them.

Pipeline · 7 stages

데이터에서 배포까지, 한 화면에서 From data to deploy, on one canvas

7단계가 자동화되어 있고, 각 단계는 독립 실행도 됩니다. Seven automated stages — each runnable on its own.

선정Select

비교Compare

데이터셋Dataset

학습Train

평가Evaluate

패키징Pack

배포Deploy

모델 선정Model Selection

Qwen · Gemma · DeepSeek · MiniMax · 이미지/영상/음성 모델 후보 등록Register Qwen · Gemma · DeepSeek · MiniMax · image/video/voice candidates

벤치 비교Bench Compare

속도 · 한국어 · 코딩 · Tool-use · RAG 정확도 · 비용을 같은 조건에서 측정Latency · Korean · coding · tool-use · RAG · cost — under one rubric

데이터셋 합성Dataset Synthesis

문서 · 대화 · 코드 · 실행 로그 → SFT · DPO · Tool-use · MCP-use 데이터 변환Docs · dialogs · code · logs → SFT · DPO · tool-use training data

어댑터 학습Adapter Training

LoRA · QLoRA · DPO · Tool-use 학습으로 회사 목적에 맞는 어댑터 생성LoRA · QLoRA · DPO · tool-use to mint goal-specific adapters

평가 · 회귀 검증Eval & Regression

학습 전후 성능, Tool-call 성공률, RAG 근거 정확도, 코드 테스트 통과율Pre/post deltas across tool-call, RAG citation, code-test pass rate

Intelligence Pack 패키징Pack & Sign

모델 · 어댑터 · 프롬프트 · 툴 · MCP · 가드레일을 하나의 .leaf 파일로Model + adapter + prompt + tools + MCP + guardrails into one .leaf file

어디서나 배포Deploy Anywhere

Mac은 MLX/Ollama, 클라우드는 vLLM/LoRAX. 같은 패키지, 같은 호출.MLX/Ollama on Mac. vLLM/LoRAX in cloud. Same package, same call.

Intelligence Pack

단순한 모델 파일이 아닙니다.
실행 가능한 지능 패키지입니다. Not a model file.
A shippable intelligence package.

모델만이 아니라 "어떻게 일하는지"까지 한 파일에 패키징됩니다. More than the brain — the operating manual ships in the same file.

.leaf

enterprise-agent.leaf

v2.4.1 · 14 components · 3.2 GB · signed

두뇌Brain

qwen3-32b.gguf lora-adapter.safetensors tokenizer.json

L1 · CORE

능력Capability

skills.yaml tools.schema.json mcp.config.toml harness.policy

L2 · CAPABILITY

행동 규칙Behavior

system.prompt rag.config runtime.config

L3 · BEHAVIOR

가드레일Guardrails

guardrails.yaml eval.policy

L4 · SAFETY

배포 매니페스트Deploy

manifest.toml eval.report.md

L5 · DEPLOY

TOOL-CALL

94.2%

RAG ACC.

89.7%

LATENCY

142ms

VRAM

18GB

# single command, anywhere

$ leaf run enterprise-agent.leaf

# or via OpenAI-compatible endpoint

curl localhost:8080/v1/chat/completions \

-d '{"model":"enterprise-agent","messages":[...]}'

하나의 .leaf 파일에 14개 컴포넌트가 들어 있습니다. Mac에서도, 클라우드에서도 같은 명령으로 실행됩니다. A single .leaf file holds 14 components. Same command on Mac or cloud.

Capability-aware Learning

답을 외우는 게 아니라,
일하는 방법을 학습합니다 Not memorizing answers.
Learning how to work.

스킬 선택 · 툴 호출 · MCP 사용 · 실패 복구 루프까지 — 8가지 학습 신호. Skill selection · tool invocation · MCP usage · failure recovery — eight signals.

SFT

정답 답변과 업무 포맷 학습 Reference answers, output schemas

DPO

좋은 행동 · 나쁜 행동의 선호 학습 Preference between good and bad actions

Tool-use

파일 · 터미널 · 검색 · 테스트 호출 File · shell · search · test invocation

MCP-use

GitHub · DB · Browser · FS 연결 GitHub · DB · browser · filesystem links

Skill-use

반복 작업의 스킬 선택 Picking the right skill for the job

Orchestration

Planner · Coder · Tester 분담 Planner · coder · tester role split

Harness Loop

분석 → 실행 → 평가 → 재시도 Analyze → act → evaluate → retry

Self-evolution

실패 분석 → 정책 수정 → 개선 Failure → policy update → improve

Composable Stacks

하나의 LLM이 아니라,
목적별 모델 조합 Not one LLM.
A composable stack per purpose.

LLM · 코딩 · 이미지 · 영상 · 음성 모델을 조합해 하나의 실행 패키지로. LLM · coding · image · video · voice — composed into a single runtime.

FEATURED

Enterprise Agent Enterprise Agent

기업 내부 문서 · 시스템과 연결되는 AI 에이전트. 가장 일반적인 구성으로, 툴 · RAG · MCP가 결합된 Reference Stack입니다. An agent connected to internal docs and systems. The reference stack — tools + RAG + MCP.

LLM · qwen3-32b

RAG · bge-m3

Tools · 14 schemas

MCP · github · db · fs

Runtime · vLLM

Guardrails · ON

코딩 에이전트Coding Agent

Sandbox · GitHub MCP · 테스트 루프 Sandbox · GitHub MCP · test loop

deepseek-coder + sandbox

크리에이티브Creative

LLM + 이미지 + 영상 + 음성 LLM + image + video + voice

flux + ltx + bark

로컬 프라이빗Local Private

데이터가 외부로 나가지 않음 Data never leaves the device

mlx + ollama + gguf

클라우드 스케일Cloud Scale

대규모 팀과 API 서비스 For teams and public APIs

vllm + lorax + multi-tenant

Runtime

노트북에서 클라우드까지,
같은 패키지로 실행됩니다 From laptop to cloud,
one package, one command

Mac M3 노트북에서도, 클라우드 GPU에서도 같은 호출 시그니처로 동작합니다. Identical call signature on an M3 laptop or a cloud GPU pool.

LOCAL

Mac · Apple Silicon

MLX Ollama GGUF

저비용 PoC 테스트Low-cost PoC iterations
보안이 중요한 환경Air-gapped, sensitive workloads
개인 개발 · 실험Solo dev exploration
완전 오프라인 실행Fully offline runtime

GATEWAY

Leaf Runtime Gateway

같은 패키지, 같은 호출 시그니처. 환경만 다릅니다. Same package, same call signature. Only the host differs.

# works in both

leaf.runtime.call(model, messages)

openai-compatible streaming batch

CLOUD

Cloud GPU Pool

vLLM LoRAX Triton

팀 단위 사용Team and org workloads
고속 추론 (멀티 LoRA)High-throughput multi-LoRA
기업용 API 서비스Public-facing API services
사용량 기반 과금Usage-metered billing

Runtime Learning

실행 경험이 곧 학습 데이터입니다 Experience becomes training

검색 · 수정 · 테스트 · 복구하는 모든 과정이 다음 모델의 학습 셋이 됩니다. Every search, edit, test, and recovery step feeds the next training set.

실행Run

호출 기록Trace

평가Score

데이터 변환Distill

재학습Retrain

새 PackRe-deploy

+12.4%

QoQ TOOL-CALL ACC.

−38%

FAILURE REPETITION

×2.1

DATASET GROWTH / MO

V1 통합

Federated · 4-Layer Privacy

데이터는 떠나지 않습니다.
지능만 옮겨갑니다. Data stays.
Only intelligence travels.

기업 데이터를 외부로 보내지 않고 모델을 학습합니다. 모델 업데이트(Δ)만 전송됩니다. Train without exposing data — only model deltas leave the boundary.

CLIENTS

은행 ABank A

12k samples · local

병원 BHospital B

8k samples · local

공장 CFactory C

22k samples · local

ENCRYPTED Δ

Local-only

data never leaves

DP Noise

ε-differential

Sec-Agg

encrypted sum

mTLS

transport layer

AGGREGATED Δ

FEDAVG SERVER

글로벌 모델Global Model

ROUND 17 · ACC 91.2%

GDPR HIPAA 금융권 규제Finance regulations On-premise SOC 2 ready

Privacy by Architecture

데이터가 나갈 길을
애초에 만들지 않습니다 No path out.
No leak.

정책 문서가 아니라 시스템 구조로 막습니다.
통제는 코드 레벨에서 이뤄집니다. Not enforced by policy docs — enforced by the architecture itself.
Control lives in code.

민감정보 자동 마스킹Sensitive Data Masking

API Key · 개인정보 · 내부 URL · 계약서를 자동으로 식별·제거Auto-redact API keys, PII, internal URLs, contract terms

Local-first 학습Local-first Training

원본 데이터는 고객 환경에 머무릅니다. 모델 가중치만 이동합니다.Source data stays in your VPC. Only weights move.

어댑터 중심 업데이트Adapter-centric Updates

전체 데이터가 아닌 학습 결과 · 메타데이터 중심으로 관리합니다.Manage by deltas and metadata, not full datasets.

감사 로그Full Audit Trail

누가, 어떤 모델을, 언제 학습 · 배포 · 호출했는지 모두 추적합니다.Who trained / deployed / invoked which model — all logged.

실행 권한 정책Execution Policies

툴 권한 · MCP 접근 · 샌드박스 실행 범위까지 코드로 통제합니다.Tool permissions, MCP scope, sandbox boundaries — all enforced in code.

Pricing

Quick에서 시작, Enterprise로 확장 Start in Quick. Scale to Enterprise.

Quick

PoC · 데모용PoC and demos

Free

모델 비교 워크벤치Model comparison workbench
데이터셋 생성 (제한)Limited dataset generation
로컬 실행 테스트Local-only runtime
커뮤니티 지원Community support

시작하기Start free

Pro

개인 · 소규모팀Solo / small team

$99 /mo

Quick의 모든 기능All of Quick
LoRA · QLoRA 학습LoRA · QLoRA training
GGUF 내보내기GGUF export
Ollama · MLX 배포Ollama · MLX deploy

시작하기Start Pro

Business

팀 단위 클라우드Cloud for teams

$499 /mo

Pro의 모든 기능All of Pro
vLLM · LoRAX 배포vLLM · LoRAX deploy
팀 사용 (10명)Up to 10 seats
API GatewayAPI Gateway
우선 지원Priority support

시작하기Start Business

Enterprise

전사 도입 · 규제 산업Org-wide / regulated

Custom

Business의 모든 기능All of Business
전용 VPC · 온프레미스VPC / on-premise
연합학습 지원Federated learning
전담 엔지니어Dedicated engineer
SLA 보장SLA

상담하기Contact sales

우리 회사의 데이터는
이미 모델이 될 준비가 됐습니다 Your data is
already a model in waiting

2분만에 첫 모델 패키지를 만들어보세요. Mint your first intelligence pack in two minutes.

Quick으로 시작 · 무료Start Quick — free Enterprise 상담Talk to sales