Open Feed Recsys Lab tutorial

How to verify, audit, and pretest X algorithm claims locally.

Use this tutorial to turn xai-org/x-algorithm into a reproducible local report, audit viral algorithm claims against public source, and compare draft posts with bounded synthetic audience research.

The problem

The open X For You algorithm repository has two different surfaces: source code you can inspect immediately, and Phoenix model artifacts that require a large Git LFS archive before inference can run.

That difference matters. If you only clone the repo, you can map the architecture and inspect the source. You cannot honestly claim you ran Phoenix unless the artifacts are downloaded and extracted.

What the lab produces

open-feed-recsys-lab generates four reproducibility artifacts:

The companion claim auditor generates source-backed verdicts for viral claims. The TinyTroupe-inspired feed research skill compares draft posts with synthetic personas. Those outputs are useful for public explanation and drafting, but they are not live X ranking predictions.

Install

git clone https://github.com/zack-dev-cm/open-feed-recsys-lab
cd open-feed-recsys-lab

Run the report generator:

python3 skill/open-feed-recsys-lab/scripts/open_feed_recsys_lab.py

By default, the script clones xai-org/x-algorithm with Git LFS smudge disabled. That keeps the first report lightweight and avoids pulling multi-GB artifacts unexpectedly.

Use an existing checkout

python3 skill/open-feed-recsys-lab/scripts/open_feed_recsys_lab.py \
  --repo-dir /path/to/x-algorithm \
  --output-dir /tmp/open-feed-recsys-lab-report

Pin a specific commit or ref:

python3 skill/open-feed-recsys-lab/scripts/open_feed_recsys_lab.py \
  --ref e414c17 \
  --output-dir /tmp/open-feed-recsys-lab-report

Read the result

Open run_report.md first. The report tells you which commit was inspected, whether the working tree was clean, whether key components are present, whether Phoenix artifacts are complete, and whether documentation and extracted configs agree.

Then open artifact_check.md. For a lightweight clone, expect the Phoenix zip to be a Git LFS pointer. That means source inspection is valid, but Phoenix execution is not ready yet.

Finally, open architecture_map.html. This is the shareable artifact: it maps the request flow across Home Mixer, Thunder, Phoenix, ads, prompts, who-to-follow, hydration, filtering, scoring, and the ranked feed response.

Run Phoenix only when artifacts exist

python3 skill/open-feed-recsys-lab/scripts/open_feed_recsys_lab.py \
  --repo-dir /path/to/x-algorithm \
  --artifacts-dir /path/to/oss-phoenix-artifacts \
  --run-phoenix

The expected extracted files include retrieval and ranker configs, model params, embedding tables, the sports corpus, and the example sequence.

Audit a viral claim

Use the claim auditor when a post says something like "replies are worth 150x a like" or "Premium gets a visibility boost":

python3 skill/x-algo-claim-auditor/scripts/x_algo_claim_auditor.py \
  --repo-dir /path/to/x-algorithm \
  --claim "Replies are worth 150x a like in the new X algorithm" \
  --output-dir /tmp/x-algo-claim-audit

The output is a verdict, evidence ledger, and share card. It should say misleading when the public source supports engagement probabilities or weights in general but not the exact viral multiplier.

Pretest draft posts

Use the TinyTroupe-inspired research skill when you have multiple drafts and want a bounded synthetic audience read:

python3 skill/tinytroupe-feed-research-lab/scripts/tinytroupe_feed_research_lab.py \
  --audience "AI builders and creator-operators interested in source-backed algorithm research" \
  --draft "I audited a viral X algorithm claim against public source. Verdict: misleading. What should I check next?" \
  --draft "Replies are king. Here is what the public repo actually proves." \
  --output-dir /tmp/tinytroupe-feed-research

Read feed_research_report.md for the best synthetic conversation draft, objections, and rewrite suggestions.

Boundary: This is synthetic audience research. It does not predict reach, rank, virality, revenue, or live X production behavior.

What this does not prove

This does not prove that the public repo fully matches live production behavior. It also does not predict reach, virality, creator growth, or account performance.

It proves something narrower and more useful: what source was inspected, what artifacts are present, whether the local Phoenix run is ready, where public docs and configs agree or disagree, whether a viral claim is supported by public files, and how synthetic personas react to a draft under explicit boundaries.

Share prompt

Use this only after generating your own report:

I verified the open X For You algorithm repo locally at commit <sha>. Here is the report: source inspected, Phoenix artifact readiness checked, and architecture mapped.

Install the skills from ClawHub

clawhub install open-feed-recsys-lab
clawhub install x-algo-claim-auditor
clawhub install tinytroupe-feed-research-lab