docs: update TODO acceptance status

chore: update ai-review findings [skip ci]
refactor: add performance improvement suggestion for filterDiff regex in exclusions.json
2026-05-13 06:12:23 +00:00 · 2026-05-13 02:24:36 +00:00 · 2026-05-13 02:23:30 +00:00 · 2026-05-13 02:18:00 +00:00 · 2026-05-13 02:16:13 +00:00 · 2026-05-13 01:47:56 +00:00
7 changed files with 61 additions and 74 deletions
@@ -223,5 +223,25 @@
    "role": "Leo",
    "location": "TODO.md",
    "suggestion": "TODO.md 的階段編號僅供內部開發追蹤，無外部文件引用，階段編號調整不影響任何外部一致性"
+  },
+  {
+    "role": "Rex",
+    "location": "app/gitea.js",
+    "suggestion": "getPRDiff 函數現在回傳未經過濾的原始 Git Diff 內容。雖然 main.js 中已立即呼叫 filterDiff 進行過濾，但這種設計模式將過濾的責任完全推給呼叫端，這增加了未來開發者在其他地方呼叫 getPRDiff 時，可能忘記過濾出敏感路徑，導致 .gitea/ 等敏感路徑的內容（可能包含工作流程設定或憑證資訊）被意外傳送給 AI 或其他不應接收的組件，造成資訊洩漏風險。建議將過濾邏輯保留在 getPRDiff 內容，或提供一個明確的 getFilteredPRDiff 函數，以降低錯誤的風險。"
+  },
+  {
+    "role": "Zara",
+    "location": "app/git.js",
+    "suggestion": "在 main.js 中，commitAndPush 函數內部會再次呼叫 cloneRepo，然而 main.js 在此之前已呼叫過 cloneRepo 以取得 repoDir，這導致了重複的 git fetch 和 git checkout 操作。即使 cloneRepo 內容有檢查環境變數，仍會造成不必要的清潔和時間延遲。建議修改 commitAndPush 邏輯，使其接收已存在的 repoDir 作為參數，避免重複執行 cloneRepo。"
+  },
+  {
+    "role": "Aria",
+    "location": "app/main.js",
+    "suggestion": "在 main.js 中，表達式 repoDir。"
+  },
+  {
+    "role": "Zara",
+    "location": "app/gitea.js:L20-L21",
+    "suggestion": "將 filterDiff 中的正規表達式比對（RegExp.match）替換為 String.startsWith 是一個重要的效能改進。startsWith 是一個更輕量且高效的字串操作，尤其在處理大型 Git Diff 內容時，此修改已顯著提升過濾效率。"
  }
 ]
@@ -1,23 +1 @@
-[
-  {
-    "level": "info",
-    "role": "Rex",
-    "location": "app/gitea.js:19",
-    "suggestion": "將 `filterDiff` 函數中的 diff 區塊過濾邏輯從正則表達式改為 `startsWith` 是一個重要的安全改進。這可以有效防止潛在的正則表達式注入攻擊，即使 `excludePrefixes` 參數未來可能受到外部控制，也能確保過濾邏輯的安全性。",
-    "is_new": true
-  },
-  {
-    "level": "info",
-    "role": "Rex",
-    "location": "app/main.js:46",
-    "suggestion": "在將 Git Diff 內容傳遞給 AI 進行分析之前，明確呼叫 `filterDiff` 函數以排除 `.gitea/` 等敏感路徑，是一個良好的安全實踐。這有助於避免 AI 分析到不必要的或包含敏感配置的非業務程式碼，降低潛在的資訊洩漏風險。",
-    "is_new": true
-  },
-  {
-    "level": "info",
-    "role": "Rex",
-    "location": "app/main.js:98",
-    "suggestion": "新增對 `findings.json` 和 `exclusions.json` 檔案進行 JSON 格式驗證的步驟，並在格式錯誤時嘗試重置和備份，這是一個重要的健壯性與安全措施。它能防止因檔案損壞或惡意修改導致的服務中斷或行為異常，確保系統的穩定性和資料的完整性。",
-    "is_new": true
-  }
-]
+[]
@@ -3,54 +3,57 @@
 ## 階段一：基本流程串接
 - 目標：確保 action 可以被觸發，pipeline 各步驟依序執行，log 出每個主要階段的進入與完成。
 - 驗收：log 中能看到每個階段（如「Step1: pipeline start」、「Step2: findings merge」等）明確訊息，且流程能走完（即使還沒產生 findings）。
- 未驗收
+- 已驗收：`code-review` job 的 log 已完整出現 `Step1` 到 `Step8`，並以 `Pipeline 完成` 結束。

 ## 階段二：Git Diff 排除 .gitea/ 資料夾
 - 目標：讀取 Git Diff 時排除 `.gitea/` 資料夾內的所有檔案，避免 AI 分析 workflow 設定等非業務程式碼。
 - 驗收：PR 中有 `.gitea/` 路徑的變更時，diff 內容不包含該路徑的區塊，AI 分析結果不含 `.gitea/` 相關問題。
- 未驗收
+- 已驗收：`app/gitea.js` 已在取得 diff 時過濾 `.gitea/` 區塊，且相關單元測試已覆蓋。

 ## 階段三：Findings 產生與合併
 - 目標：各角色（style/security/performance/maintainability/testing）能產生 findings，並正確合併新舊 findings。
 - 驗收：log 中能看到每個角色 findings 數量、合併後 findings 統計，並有「Step3: merged findings total=...」等訊息。
- 未驗收
+- 已驗收：log 已顯示 5 個角色皆有分析結果，並出現 `Step3 merged findings total=13`。

 ## 階段四：AI 去重與角色確認
 - 目標：嘗試呼叫 LLM 進行 findings 去重與角色確認，API 額度不足時要有降級處理 log。
 - 驗收：log 中能看到 deduplication/resolution confirmation 成功或失敗（如 402），降級時有「保留所有問題」等明確訊息。
- 未驗收
+- 已驗收：log 已出現 `AI 去重: 13 -> 11 筆`，且程式具備失敗時保留所有問題的降級處理。

 ## 階段五：AI 排除問題過濾
 - 目標：讀取排除問題檔案（`.gitea/ai-review/exclusions.json`）進行規則過濾，並呼叫 AI 判斷剩餘問題是否為誤報或不適用，兩層過濾後產生最終問題清單。
 - 驗收：log 中能看到排除問題檔案讀取成功或不存在的訊息、規則過濾數量變化，以及「AI 誤報過濾: N -> M 筆」或降級訊息。
- 未驗收
+- 部分驗收：log 已顯示 `讀取排除問題: 50 筆` 與 `排除過濾: 11 -> 0 筆`，但這次未進入 `AI 誤報過濾: N -> M 筆` 的正向路徑。
+- 可驗收紀錄情境：當 `排除過濾` 後仍保留 1 筆以上 findings 時，log 會出現 `AI 誤報過濾: N -> M 筆`；若 API 額度不足或回傳失敗，則會出現 `AI 誤報過濾失敗（...），降級：保留所有問題`。

 ## 階段六：findings 寫入與 comment 發布
 - 目標：`.gitea/ai-review/findings.json` 正確寫入，comment 發布順序正確（舊問題→非嚴重→嚴重），每步有 log。
 - 驗收：log 中能看到 `.gitea/ai-review/findings.json` 寫入、comment sync 的詳細訊息與順序。
- 未驗收
+- 部分驗收：`findings.json` 已成功寫入，也有依序執行舊問題、非嚴重、嚴重 comment 流程；但本次因結果為 0 筆，沒有實際 comment 內容可完整驗證順序。
+- 可驗收紀錄情境：當最終 findings 至少有 1 筆舊問題、1 筆新非嚴重問題或 1 筆新嚴重問題時，log 會分別出現 `舊問題 comment 發布`、`新問題（非嚴重）comment 發布`、`嚴重問題 comment 發布`；其中嚴重問題會逐筆發 comment。

 ## 階段七：階段六後驗證 JSON 格式
 - 目標：階段六完成後驗證 `findings.json` 與 `exclusions.json` 是否為合法 JSON 格式，格式錯誤時先嘗試重置為空陣列並備份原檔，修正失敗才 exit 1。
 - 驗收：log 中能看到兩個檔案的驗證結果（成功或失敗），格式錯誤時有「嘗試修正」訊息與備份路徑，修正失敗時 workflow 狀態為失敗。
- 未驗收
+- 已驗收：log 已明確顯示 `.gitea/ai-review/findings.json` 與 `.gitea/ai-review/exclusions.json` 都是 `JSON 格式正確`。

 ## 階段八：記憶區 commit/push 與錯誤處理
 - 目標：記憶區能成功 commit/push，錯誤時有明確 log，流程結束有總結訊息。
 - 驗收：log 有「persisted findings」、「commit=...」、「push=...」等訊息，錯誤時有「Runner failed: ...」等明確錯誤說明。
- 未驗收
+- 已驗收：log 已出現 `persisted findings commit=79506eb push=整理程式碼`，代表 commit/push 成功。

 ## 階段九：阻擋嚴重問題 PR（第 8 點）
 - 目標：如果 PR 問題表格中有嚴重（critical）問題，workflow 需直接 exit 1，不讓流程成功。
 - 驗收：log 中能看到「critical 問題存在，workflow 結束（exit 1）」等明確訊息，且 workflow 狀態為失敗。
- 未驗收
+- 部分驗收：這次 log 顯示 `✅ 無嚴重問題`，因此只驗到正常放行路徑；`exit 1` 的阻擋分支仍需另一次含 critical 的 PR log 驗證。
+- 可驗收紀錄情境：只要 `Step8` 出現 `發現 X 個嚴重問題，workflow 結束（exit 1）`，且 job 以失敗結束，就能驗收這一項；如果該次 PR 的 `filtered` 清單含 `critical`，就應該會看到這段 log。

 ## 階段十：API Key 輪替
 - 目標：所有平台的 API Key 支援逗號分隔傳入多個，隨機順序各嘗試一次，單一 Key 失敗時自動換下一個，全部失敗則 exit 1。
 - 驗收：log 中能看到「key[N/M] 失敗」等訊息，換 key 後繼續執行；傳入單一 Key 時行為與原本相同；全部 Key 失敗時 log「所有 API Key 均失敗，終止流程」且 workflow 狀態為失敗。
- 未驗收
+- 已驗收：`review.yaml` 已以逗號串接多把 Gemini key，且 `app/llm.js` 與單元測試已覆蓋輪替與失敗退出行為。

 ## 階段十一：壓縮 AI 傳入內容減少 token 用量
 - 目標：傳給 AI 的 findings 只保留必要欄位（level、role、location、suggestion）；system prompt 精簡為指令核心；exclusions hint 只傳 location 與 suggestion；AI 回傳後補回原始完整欄位（含 is_new）。
 - 驗收：AI 呼叫的 payload 不含 is_new 等內部欄位，去重與誤報過濾後的 findings 仍保有完整欄位供後續流程使用。
- 未驗收
+- 已驗收：`app/findings.js` 已只傳必要欄位給 AI，並在回傳後補回原始 findings 的完整欄位。
@@ -47,12 +47,10 @@ export function cloneRepo(workspace, _spawnSync = spawnSync) {
  });
 }

-export async function commitAndPush(workspace, _spawnSync = spawnSync) {
+export async function commitAndPush(workspace, repoDir, _spawnSync = spawnSync) {
  const run = makeRunner(_spawnSync);

  try {
-    const repoDir = cloneRepo(workspace, _spawnSync);
-
    await withAskpass(workspace, async credEnv => {
      run(['config', 'user.email', 'ai-review[bot]@gitea'], repoDir);
      run(['config', 'user.name', 'AI Review Bot'], repoDir);
@@ -38,7 +38,6 @@ describe('commitAndPush', () => {
  before(() => { workspace = makeTmpWorkspace(); });
  after(() => { fs.rmSync(workspace, { recursive: true, force: true }); });
  beforeEach(() => {
-    // Remove leftover askpass scripts between tests
    for (const f of fs.readdirSync(workspace)) {
      if (f.endsWith('.git-askpass.sh')) fs.unlinkSync(path.join(workspace, f));
    }
@@ -46,7 +45,7 @@ describe('commitAndPush', () => {

  it('does not embed token in any git command argument', async () => {
    const spawn = makeSpawn();
-    await commitAndPush(workspace, spawn);
+    await commitAndPush(workspace, path.join(workspace, 'repo'), spawn);

    for (const { args } of spawn.calls) {
      assert.ok(!args.join(' ').includes('test-token'), `Token leaked in git args: ${args.join(' ')}`);
@@ -55,7 +54,7 @@ describe('commitAndPush', () => {

  it('uses GIT_ASKPASS env for network operations (fetch, push, clone)', async () => {
    const spawn = makeSpawn();
-    await commitAndPush(workspace, spawn);
+    await commitAndPush(workspace, path.join(workspace, 'repo'), spawn);

    const networkOps = ['fetch', 'push', 'clone'];
    const networkCalls = spawn.calls.filter(c => networkOps.includes(c.args[0]));
@@ -67,28 +66,28 @@ describe('commitAndPush', () => {
  });

  it('cleans up askpass script after successful run', async () => {
-    await commitAndPush(workspace, makeSpawn());
+    await commitAndPush(workspace, path.join(workspace, 'repo'), makeSpawn());
    const leftover = fs.readdirSync(workspace).filter(f => f.endsWith('.git-askpass.sh'));
    assert.equal(leftover.length, 0, 'askpass script was not cleaned up');
  });

  it('cleans up askpass script even when git fails', async () => {
    const failSpawn = () => ({ status: 1, stdout: '', stderr: 'fatal: error', error: null });
-    await commitAndPush(workspace, failSpawn);
+    await commitAndPush(workspace, path.join(workspace, 'repo'), failSpawn);
    const leftover = fs.readdirSync(workspace).filter(f => f.endsWith('.git-askpass.sh'));
    assert.equal(leftover.length, 0, 'askpass script was not cleaned up after failure');
  });

  it('skips commit when status shows no changes', async () => {
    const spawn = makeSpawn({ status: () => ({ status: 0, stdout: '', stderr: '', error: null }) });
-    await commitAndPush(workspace, spawn);
+    await commitAndPush(workspace, path.join(workspace, 'repo'), spawn);
    const commitCalled = spawn.calls.some(c => c.args[0] === 'commit');
    assert.equal(commitCalled, false, 'commit should not run when there are no changes');
  });

  it('does not throw when git command fails', async () => {
    const failSpawn = () => ({ status: 1, stdout: '', stderr: 'fatal: error', error: null });
-    await assert.doesNotReject(() => commitAndPush(workspace, failSpawn));
+    await assert.doesNotReject(() => commitAndPush(workspace, path.join(workspace, 'repo'), failSpawn));
  });
 });

@@ -7,12 +7,11 @@ const headers = () => ({ Authorization: `token ${GITEA_TOKEN}`, 'Content-Type':
 const api = (path) => `${GITEA_SERVER_URL.replace(/\/$/, '')}/api/v1${path}`;

 /**
- * 取得 PR 的原始 Git Diff 內容。
- * 注意：回傳值未經路徑過濾，呼叫端須使用 filterDiff 排除敏感路徑（如 .gitea/）後再傳給 AI。
+ * 取得 PR 的 Git Diff 內容，已自動排除 .gitea/ 資料夾。
 */
 export async function getPRDiff() {
  const resp = await axios.get(api(`/repos/${GITEA_REPOSITORY}/pulls/${PR_NUMBER}.diff`), { headers: headers(), timeout: 60000, httpsAgent });
-  return resp.data;
+  return filterDiff(resp.data, ['.gitea/']);
 }

 /**
@@ -2,7 +2,7 @@ import fs from 'fs';
 import path from 'path';
 import { GITEA_REPOSITORY, PR_NUMBER, PR_HEAD_BRANCH, PR_BASE_BRANCH, getLLMConfig, FINDINGS_PATH, EXCLUSIONS_PATH } from './config.js';
 import { loadRoles, getRoleIntro } from './roles.js';
-import { getPRDiff, filterDiff, postComment } from './gitea.js';
+import { getPRDiff, postComment } from './gitea.js';
 import { analyzeWithRole, loadOldFindings, mergeFindings, sortByLevel, deduplicateWithAI, loadExclusions, applyExclusions, filterFalsePositivesWithAI } from './findings.js';
 import { saveFindings, postOldFindingsComment, postNewNonCriticalComment, postNewCriticalComments } from './comments.js';
 import { cloneRepo, commitAndPush } from './git.js';
@@ -47,18 +47,8 @@ async function main() {
    console.log(`  ⚠️  comment 發布失敗（繼續執行）: ${e.message}`);
  }

-  // Step2: 排除 .gitea/ 資料夾內的所有檔案
-  console.log('\n🗂️  Step2: Git Diff 過濾');
-  diff = filterDiff(diff, ['.gitea/']);
-  console.log(`  排除 .gitea/ 後 diff 長度: ${diff.length} 字元`);
-
-  if (!diff.trim()) {
-    console.log('  ⚠️  過濾後 diff 為空，無需審查');
-    process.exit(0);
-  }
-
-  // Step3: 各角色分析 diff 產生新 findings
-  console.log('\n📊 Step3: Findings 產生');
+  // Step2: 各角色分析 diff 產生新 findings
+  console.log('\n📊 Step2: Findings 產生');
  const results = await Promise.allSettled(roles.map(role => analyzeWithRole(role, diff)));
  const newFindings = [];
  for (let i = 0; i < results.length; i++) {
@@ -68,10 +58,10 @@ async function main() {
      console.log(`  ⚠️  [${roles[i].name}] 分析失敗（跳過）: ${results[i].reason?.message}`);
    }
  }
-  console.log(`  Step3 完成: 新 findings 總計 ${newFindings.length} 筆`);
+  console.log(`  Step2 完成: 新 findings 總計 ${newFindings.length} 筆`);

  // Step4: 讀取舊 findings，合併去重（含 AI 語意去重）
-  console.log('\n🔀 Step4: Findings 合併');
+  console.log('\n🔀 Step3: Findings 合併');
  // Clone repo 以讀取舊 findings 與排除清單
  let repoDir;
  try {
@@ -81,35 +71,35 @@ async function main() {
  }
  const oldFindings = loadOldFindings(repoDir || WORKSPACE);
  const mergedFindings = mergeFindings(oldFindings, newFindings);
-  console.log(`  Step4 merged findings total=${mergedFindings.length}`);
+  console.log(`  Step3 merged findings total=${mergedFindings.length}`);

-  console.log('\n🤖 Step4b: AI 語意去重');
+  console.log('\n🤖 Step3b: AI 語意去重');
  const deduped = await deduplicateWithAI(mergedFindings);
  const sorted = sortByLevel(deduped);
-  console.log(`  Step4b dedup findings total=${sorted.length} (critical=${sorted.filter(f=>f.level==='critical').length} warning=${sorted.filter(f=>f.level==='warning').length} info=${sorted.filter(f=>f.level==='info').length})`);
+  console.log(`  Step3b dedup findings total=${sorted.length} (critical=${sorted.filter(f=>f.level==='critical').length} warning=${sorted.filter(f=>f.level==='warning').length} info=${sorted.filter(f=>f.level==='info').length})`);

  // Step5: 讀取排除問題檔案，過濾 PR 問題表格，並請 AI 判斷誤報
-  console.log('\n🚫 Step5: AI 排除問題過濾');
+  console.log('\n🚫 Step4: AI 排除問題過濾');
  // 輸入至 findings 用於 AI 誤報過濾，exclusions 同時作為已知誤報參考
  const exclusions = loadExclusions(repoDir || WORKSPACE);
  const ruleFiltered = applyExclusions(sorted, exclusions);
  const filtered = await filterFalsePositivesWithAI(ruleFiltered, exclusions);
-  console.log(`  Step5 完成: findings total=${filtered.length}`);
+  console.log(`  Step4 完成: findings total=${filtered.length}`);

  // Step6: 寫入 findings.json，依序發布 comment
-  console.log('\n📝 Step6: Findings 寫入與 Comment 發布');
+  console.log('\n📝 Step5: Findings 寫入與 Comment 發布');
  saveFindings(WORKSPACE, filtered);
  try {
    await postOldFindingsComment(filtered);
    await postNewNonCriticalComment(filtered);
    await postNewCriticalComments(filtered);
-    console.log('  Step6 完成');
+    console.log('  Step5 完成');
  } catch (e) {
    console.log(`  ⚠️  comment 發布失敗（繼續執行）: ${e.message}`);
  }

  // Step7: 驗證 findings.json 與 exclusions.json 為合法 JSON
-  console.log('\n🔎 Step7: JSON 格式驗證');
+  console.log('\n🔎 Step6: JSON 格式驗證');
  for (const relPath of [FINDINGS_PATH, EXCLUSIONS_PATH]) {
    const fullPath = path.join(repoDir || WORKSPACE, relPath);
    if (!fs.existsSync(fullPath)) {
@@ -133,12 +123,12 @@ async function main() {
    }
  }

-  // Step8: commit/push findings.json 到來源分支
-  console.log('\n💾 Step8: 記憶區 Commit/Push');
-  await commitAndPush(WORKSPACE);
+  // Step7: commit/push findings.json 到來源分支
+  console.log('\n💾 Step7: 記憶區 Commit/Push');
+  await commitAndPush(WORKSPACE, repoDir);

  // Step9: 有 critical 問題則 exit 1
-  console.log('\n🚦 Step9: 嚴重問題檢查');
+  console.log('\n🚦 Step8: 嚴重問題檢查');
  const criticalCount = filtered.filter(f => f.level === 'critical').length;
  if (criticalCount > 0) {
    console.log(`  ❌ 發現 ${criticalCount} 個嚴重問題，workflow 結束（exit 1）`);
Author	SHA1	Message	Date
jiantw83	480a0693f7	docs: update TODO acceptance status	2026-05-13 06:12:23 +00:00
AI Review Bot	79506eb905	chore: update ai-review findings [skip ci]	2026-05-13 02:24:36 +00:00
jiantw83	8872e7366a	refactor: add performance improvement suggestion for filterDiff regex in exclusions.json	2026-05-13 02:23:30 +00:00
AI Review Bot	7616dd1816	chore: update ai-review findings [skip ci]	2026-05-13 02:18:00 +00:00
jiantw83	9bef365a32	refactor: enhance suggestions in exclusions.json for improved security and efficiency	2026-05-13 02:16:13 +00:00
AI Review Bot	21b3df6d79	chore: update ai-review findings [skip ci]	2026-05-13 01:47:56 +00:00
jiantw83	cc6345c32e	refactor: update commitAndPush function to accept repoDir parameter and adjust related tests	2026-05-13 01:46:27 +00:00
AI Review Bot	c758c99a28	chore: update ai-review findings [skip ci]	2026-05-13 01:42:23 +00:00
admin	505cf6d30d	Merge pull request 'refactor: optimize AI payload by reducing token usage and streamline findings structure' (#89 ) from feat/壓縮AI內容 into 整理程式碼 Reviewed-on: #89	2026-05-13 01:41:12 +00:00
AI Review Bot	c3e57ff442	chore: update ai-review findings [skip ci]	2026-05-13 01:40:58 +00:00