training language models to follow instructions with human feedback — search2

Chronological history of training language models to follow instructions with human feedback. Key events, milestones, sentiment shifts, first appearances, and patent filings with full provenance.

Data Sources

Results synthesized from web crawls, academic citations (Semantic Scholar), patent filings (USPTO), corporate ownership records (SEC EDGAR, Companies House), software dependency graphs (npm, PyPI, crates.io), infrastructure records (DNS, TLS certificates, IP ranges), temporal archives (Wayback Machine), government datasets (Data.gov), and conversational sources (Hacker News, Reddit).

View Modes