gitcrawl/CHANGELOG.md
Peter Steinberger 469d89bc1a
Some checks failed
CI / Go / ${{ matrix.os }} (macos-latest) (push) Has been cancelled
CI / Go / ${{ matrix.os }} (ubuntu-latest) (push) Has been cancelled
Pages / Deploy docs (push) Has been cancelled
Security Gate: Secret Scanning / Scan for Verified Secrets (push) Has been cancelled
chore: prepare gitcrawl 0.3.1
2026-05-08 09:56:02 +01:00

7.2 KiB

Changelog

Unreleased

0.3.1 - 2026-05-08

  • Fix gh-shim portable-store auto-hydration so exact issue/PR refreshes write to the runtime mirror instead of dirtying the Git checkout, clear stale portable refresh locks, and make empty open issue discovery fall through when only targeted sync history exists.
  • Keep cluster-detail aligned with the default cluster list by showing closed historical members unless --hide-closed is passed, and fail fast when GITCRAWL_GH_PATH points back at the gitcrawl shim.

0.3.0 - 2026-05-08

  • Bump routine release workflow dependencies.
  • Add a repo-local gitcrawl agent skill for local archive, freshness, gh-shim, cluster, and verification workflows.
  • Accept full GitHub issue and pull request URLs anywhere gitcrawl expects a thread number, including sync filters, gh-shim views/diffs, governance commands, neighbor lookup, embedding, and TUI jumps.
  • Document read-only SQLite query examples in the repo-local agent skill so agents can do exact local archive counts without mutating state.
  • Document the crawlkit control surface now available on main, including metadata --json, status --json, and doctor --json for local launchers and CI.
  • Clarify that gitcrawl tui remains the reference terminal browser for the crawl app family while shared crawlkit/tui converges on the same panes, sorting, action menus, and status chrome.
  • Add command-reference coverage for the read-only metadata/status commands.
  • Add broader CLI, gh-shim, TUI, and store regression coverage for the verified release surface.

0.2.1 - 2026-05-05

  • Improve gh shim cache coordination and observability with stale-while-revalidate reads, finer Actions/API TTLs, recent-window stats, top miss keys, and xcache snapshot.

0.2.0 - 2026-05-05

  • Add Homebrew tap installation via brew install openclaw/tap/gitcrawl.
  • Improve the gh shim cache with canonicalized keys, targeted mutation invalidation, stale-on-rate-limit fallback reads, completed-run TTLs, hit-rate stats, counter reset, and issue auto-hydration.
  • Add dark-mode support, a theme toggle, and clearer navigation styling to the generated docs site.
  • Force embedding refreshes when the embedding input rune cap changes, so stale larger-cap vectors are not reused.
  • Expand the gh shim with local list filters, PR diff caching by cached head SHA, xcache GC, hit/miss/write counters, and throttled portable-store refreshes to reduce GitHub API pressure across agent sessions.
  • Add explicit PR-detail hydration for files, commits, checks, and workflow runs so gh pr view, gh pr checks, and gh run list/view can answer common review reads from the existing SQLite cache.
  • Auto-hydrate one exact pull request when local PR detail reads miss or check/run data is stale, using gh auth token if GITHUB_TOKEN is absent, then retry from SQLite before falling back to live gh.
  • Cache more ghx-style read-only fallthroughs, including release, workflow, secret, variable, project, ruleset, gist, org, and search reads; cache repeat read failures by default; and clear the fallthrough cache after the corresponding mutating gh commands.
  • Promote portable backups to the v2 format: keep compact comments, PR files, commits, checks, and workflow runs while stripping raw JSON, generated documents, vectors, clusters, and run history.
  • Add crawlkit control metadata/status surfaces with command-local metadata --json, status --json, and doctor --json.
  • Include the primary SQLite database inventory in status JSON so local control surfaces can discover archive storage without opening live stores.
  • Route config path handling and SQLite openers through crawlkit so GitHub archive tooling shares the same foundation as the Slack, Discord, and Notion crawlers.
  • Keep shared crawl app TUI nomenclature aligned while gitcrawl tui remains the richer cluster-browser reference implementation.
  • Keep the existing gitcrawl tui as the family reference terminal interface and add CI smoke coverage for its help surface.

0.1.2 - 2026-05-01

  • Polish the TUI cluster browser interaction model, including separate cluster/member action menus, softer row state colors, stable viewport refresh, bidirectional age sorting, and buffered trackpad scrolling.
  • Add OpenAI embedding retry handling for transient failures and cap oversized embedding inputs before sending them upstream.
  • Improve GitHub pagination and retry behavior by surfacing page totals and honoring retry and rate-limit response headers.
  • Harden human-key hash parsing and tidy the module graph.

0.1.1 - 2026-04-30

  • Fix portable store refreshes when local Git pull configuration tries to rebase onto multiple branch merge refs.
  • Honor GITCRAWL_GITHUB_BASE_URL and GITHUB_BASE_URL during gitcrawl sync, matching cached search and test-server workflows.
  • Fix cached search issues|prs against portable stores by using portable-safe thread body and raw JSON columns.
  • Keep read-only portable-store commands responsive when the backing Git remote is unavailable by making refresh best-effort and non-interactive with bounded SSH connection attempts.

0.1.0 - 2026-04-30

  • Add gitcrawl sync --numbers for exact issue and pull request hydration, including comment documents, without relying on list ordering or updated-time windows.
  • Implement gitcrawl refresh and gitcrawl embed so synced repositories can generate OpenAI embeddings and rebuild durable clusters end to end.
  • Add gitcrawl sync --state open|closed|all so incremental backups can refresh recently closed issues and pull requests.
  • Default gitcrawl sync to --state all, keeping closed issue and pull request state fresh unless a narrower state is requested.
  • Let gitcrawl search fall back to compact thread title/body data when portable stores have pruned generated document indexes.
  • Refresh clean portable-store checkouts before read-only commands so search, threads, clusters, and the TUI see freshly published GitHub backup data automatically.
  • Refresh portable-store status and clear stale SQLite sidecars so doctor and local queries report freshly pulled backup data instead of stale sync metadata.
  • Open writable runtime mirrors for portable-store configs so gitcrawl embed, refresh, and semantic neighbor generation can persist local vectors without mutating the GitHub backup checkout.
  • Show active primary cluster memberships by default in clusters, durable-clusters, and the TUI, with --include-closed reserved for historical audit views.
  • Split generated clusters with bounded nearest-neighbor graph safeguards, GitHub reference evidence, and cross-kind score pruning so weak similarity bridges stop merging unrelated reports into one mega-cluster.
  • Tighten clustering precision by ignoring ambiguous one-digit prose references and requiring weak embedding edges to share concrete title tokens unless they have high similarity or direct GitHub reference evidence.
  • Treat later body-only issue references as weak evidence unless they share title overlap, while still preserving title and lead-body references for canonical issue/PR fix clusters.
  • Hide GitHub-closed members from latest-run cluster summaries and details by default; --include-closed still shows the full historical cluster.
  • Add release plumbing for GitHub release archives via GoReleaser.