Skip to main content

Model, Inputs, Examples & Policy

Transparent scoring (0–100) with weights, decay, examples, and actions.

C
Written by Catalin Fetean
Updated over 3 weeks ago

Audience: Everyone; especially Admins, Procurement, PMs, Compliance
Outcomes: Predict score changes; apply fair, predictable controls

What reputation is

Per-org 0–100 score used for routing, limits, and partner selection. It’s a risk/quality signal with transparent inputs and a path to improvement (not public shaming).

Inputs (signals)

  • On-time milestone rate

  • Dispute rate (size-weighted)

  • Dispute outcomes: refund/release/custom

  • Appeal quality: overturned rate

  • Refund ratio; chargeback ratio (heavier weight)

  • Evidence quality (structured/verifiable)

  • Recency: newer events count more (decay older)

  • Volume baseline: Bayesian prior for small orgs

Math (tunable)

Base prior = 75. Each event applies a delta scaled by weight and recency:

score_t = clamp(0, 100, prior + Σ(w_signal * recency_decay(days_since_event) * normalized_signal) ) recency_decay(d) = 0.5^(d / 90) # half-life 90 days

Suggested weights (example):

  • On-time >80%: +0.20 per %

  • Dispute rate >2%: −0.50 per %

  • Refund ratio: −0.60 per %

  • Chargeback ratio: −1.20 per %

  • Outcome: refund_full −8.0; refund_partial −4.0; release_to_seller +2.0; custom done on time +1.0; appeal overturned −3.0; good evidence +0.5

Bayesian stabilization (avoid whiplash when n<20):

stabilized = (prior * k + raw_score * n) / (k + n) # k=20

Exact dispute → reputation mappings

  • Opening a dispute: no change

  • At resolution:

    • refund_full −8 (decay 90d)

    • refund_partial −4 (decay 90d)

    • release_to_seller +2 (decay 90d)

    • custom 0 initially; +1 if completed on time (−1 if missed)

    • Evidence quality bonus +0.5 (structured/verifiable)

    • Appeal overturned: −3 to the side that lost

  • Diminishing returns cap per week to avoid runaway effects

Examples (intuition)

  • A: 20 orders, 95% on-time, 1 partial refund → ~77–80 (on-time offsets partial)

  • B: 5 orders, 80% on-time, 1 full refund last week → ~70–73 (prior tempers impact)

  • C: 50 orders, 3% chargebacks last quarter~60–65 until ratio improves

Policy thresholds (example; show in Admin → Reputation)

  • 85+ Trusted: fast-track reviews, lower friction

  • 70–85 Normal: standard flows

  • 55–70 Watchlist: deposit required on new orders; nudge KYC re-check

  • <55 Restricted: pre-review new contracts; higher deposit %; Neutral Admin auto-assigned; possible temporary freeze after repeated incidents

QA checklist

  • Reputation log shows each delta with timestamp, cause, weight, decay

  • Scores decayed nightly; no double-apply on day boundary

Runbook: “Score drop surprise”

  • Open Reputation log → review last deltas → check decay → simulate with weights; adjust policy or communicate remediation steps.

Did this answer your question?