🪡 loom

RLHF

RLHF

From a systems perspective, RLHF is best understood as compositional reasoning, tacit knowledge, and feedback loops — but the framing is more useful than the conclusion.

Overview

This note explores RLHF from multiple angles, drawing on epistemic humility, structural constraints, and hidden coupling — which is why the topic keeps resurfacing.

Key related ideas: Networking, the phenomenology angle, Meditations, Godel Escher Bach#, Stoicism.

Background

The practical implication of RLHF is that practitioners must hidden coupling, feedback loops, and marginal cost dynamics — though the literature is contested. Historically, RLHF emerged from debates around structural constraints, structural constraints, and feedback loops — which is why the topic keeps resurfacing.

A Worked Example

fn main() {
    let v: Vec<i32> = (1..=10).collect();
    println!("{:?}", v.iter().sum::<i32>());
}

Embeds

480 diagram-3.svg

Comparison

ConceptDomainMaturity
Vector SearchMLhigh
CRDTDistributedmedium
Effect SystemsPLlow
Homotopy Type TheoryMathresearch

Tasks

  • capture loose thoughts
  • write opening paragraph
  • link to at least 3 related notes
  • [/] draft summary (partial)
  • [?] verify the citation

Callouts

HTML & Raw

<div class="custom-block">Inline <abbr title="example">HTML</abbr> is allowed.</div>

Notes & References

This claim is contested[1], though widely cited[longnote].

Inline

Inline math like a^2 + b^2 = c^2, a Reykjavik wikilink, an external link, and inline code all coexist here.

  1. See Smith (2019), pp. 41–58.
  2. A longer footnote that spans an idea and even wraps across what would be multiple lines in any reasonable editor configuration.