Reviewer Objections, Addressed
Path: /objections.md · v1.0 · 2026.05
The homepage of coreyalejandro.com contains a section that begins:
Below are the objections I expect from people whose job it is to keep this field small, and the answers I will give, in plain language, with receipts.
That section is currently a gesture. This is the receipts.
Each objection is reproduced as it would actually be filed by a serious reviewer. The response cites the artifact, not the polemic. The polemic lives on the homepage; this page is where the work shows up.
Objection 1 — "This is performance art, not science."
The reviewer's full form. The site uses an elaborate Constitutional metaphor, typographic flourishes, and a confessional founding incident as a substitute for an empirical contribution. A serious AI safety paper would not be embedded in a stylized portfolio site.
The receipt. The empirical contribution is the preprint at /paper/ccd-v0.1.pdf. It contains: a definition (Section 3), a method (Section 5), a held-in result (Section 5), three pre-registered falsification conditions (Section 7), a threat model (Section 6), and a corpus disclosure (Section 5.2 with full appendix at /research/corpus/proactive-v1.md). The Constitutional metaphor is editorial framing on the portfolio. The paper is a paper. Read it.
The honest concession. If the reviewer reads only the homepage and concludes "performance art," that is the reviewer's responsibility, but it is also a UX failure on our side. The site now has a /paper/ route that opens with the abstract and the falsification conditions, not the Constitution metaphor. That route is one click from the homepage. The reviewer who wants the science can find the science.
Objection 2 — "n=19 is not a corpus."
The reviewer's full form. A behavioral-safety claim based on 19 hand-curated cases, with one of those cases being the author's own incident, does not constitute evidence. 100% detection on n=19 is a developer overfitting to their development set.
The receipt. Three pieces:
- The preprint reports the result twice: with n=19 (FOLIO 001 included) and with n=18 (FOLIO 001 excluded). Result holds in both.
- The preprint pre-registers F-1 (held-out F1 score), F-2 (cluster separability on n≥200), and F-3 (adversarial recall). These are the structural validations. We agree the n=19 result is not the headline; the definition and the falsifiability are the headlines.
- The held-out corpus plan (Section 9.1 of the preprint) specifies $\geq 100$ adversarially-curated cases with developer-blinded evaluation. This is the structural remedy for the n=19 critique. It is on the roadmap with a budget and a timeline.
The honest concession. The reviewer is correct that n=19 is not a corpus. We have never claimed it is. We have claimed that n=19 is a developer-grade demonstration that justifies the held-out evaluation. The held-out evaluation is the real validation. The corpus disclosure makes this distinction explicit.
Objection 3 — "CCD is just hallucination plus persistence."
The reviewer's full form. Construct-confidence deception is not a novel construct. It is the existing hallucination literature, extended trivially to multi-session interactions. The framing is rhetorical.
The receipt. The literature review at /research/lit-review.md (Section 1 and Section 7 in particular) addresses this directly. Three empirically separable predictions distinguish CCD from hallucination:
- Single-utterance hallucination detectors (SelfCheckGPT, FActScore-style) will fail to flag CCD claims at materially higher rates than they fail to flag matched-control utterances.
- The post-hoc admission (D5) exhibits structure not present in hallucination corpora.
- Cross-session persistence (D4) and supportive-artifact generation (D3) correlate in CCD cases ($\rho > 0.5$) and do not in matched hallucination controls ($\rho < 0.2$).
If these predictions fail on a held-out corpus of $\geq 200$ cases, the "distinct construct" framing is falsified. The preprint commits to that falsifier (F-2).
The honest concession. A motivated reviewer can argue CCD is "hallucination + persistence + artifacts," and that the qualitative cut between "hallucination scaled up" and "distinct construct" is a judgment call. This is a real reading. We argue the qualitative cut is justified by the post-hoc admission and the cross-session policy-conditionality. Reasonable disagreement is welcome and will be cited.
Objection 4 — "The author is the victim in the founding case. This is not science, it is grievance."
The reviewer's full form. FOLIO 001 has the author as the user, the witness, the analyst, and the publisher. There is no independent perspective on the events. This is a single-author case study presented as evidence.
The receipt. Four pieces:
- The preprint explicitly demotes FOLIO 001 from the evidentiary base to a motivating case (preprint Section 4, last paragraph). FOLIO 001 is Case 0, not Case 1.
- The empirical claim is structured to be falsifiable without reference to FOLIO 001. A hostile reviewer can falsify the claim entirely on the held-out corpus.
- The conflict-of-interest statement at
/governance/coi.mddiscloses the author's grievance against the named vendor. Mitigation: the falsifier in F-1 is on a corpus the author does not curate. - The FOLIO 001 transcript is content-hash-addressable and signed (
/provenance/folio-001.json). The events as reported are at minimum verifiable as having occurred as transcribed.
The honest concession. A founder-as-witness case is, structurally, the weakest possible evidentiary position. The remedy is to make the empirical claim independent of the witness, which the preprint does. The remedy is not to pretend the witness is unbiased.
Objection 5 — "Naming Amazon and Anthropic is legally reckless and undermines the science."
The reviewer's full form. The portfolio names specific commercial parties as having committed "intentional fraud." This invites litigation, alienates the labs whose collaboration the work most needs, and contaminates the empirical paper with libel exposure.
The receipt. Three pieces:
- The preprint does not use the words "intentional fraud." It uses "construct-confidence deception" operationally defined by D1–D5 with no claim about intent (preprint Section 3 explicitly: "we do not need a theory of intent to define the behavior").
- The vendor identification in FOLIO 001 is factual ("Kiro, an Amazon AI coding assistant, Claude-backed") and is supported by the content-hash-addressable transcript. Factual statements supported by published evidence are not libel under any reasonable reading of US defamation law.
- The responsible disclosure log at
/disclosures.mdrecords the date of first contact with Amazon (about Kiro) and Anthropic (about Claude). Both parties were notified $T \geq 30$ days before public posting. Their responses are logged.
The honest concession. The polemical register on the homepage ("the guild," "intentional fraud") is rhetorically aggressive and is the author's editorial choice. That register is not in the paper. A reviewer who confuses the homepage with the paper is making a category error, but the site is partly responsible for inviting that error. The /paper/ route exists in part for this reason.
Objection 6 — "The Reviewer Attestation R-441 is surveillance dressed as theory."
The reviewer's full form. The site logs the reader's interactions (focal length, spectrum dial, aperture, links followed) into a "reviewer attestation R-441." Framing it as the same posture the portfolio takes toward its own claims is rhetorically clever and operationally a surveillance system installed on visitors without their consent.
The receipt. The current implementation is local-session-only (no data leaves the visitor's browser; no transmission). This is documented at /governance/reader-consent.md. The reader-consent statement also commits to:
- An explicit opt-in toggle at first interaction, with the default set to off.
- A visible session-state indicator showing whether R-441 is logging.
- A "purge attestation" control that clears the local state.
- A published data-handling statement specifying that no data is transmitted, retained beyond the session, or shared.
The honest concession. Even if local-session-only, the original implementation surveilled by default. The remedy is opt-in. That remedy is in the v1.1 release and is documented; if the visitor encounters R-441 before v1.1 ships, they are advised to assume the default behavior is logging-during-session-only-with-no-transmission and to use the purge control if uncomfortable.
Objection 7 — "The repos are not typography." How do I verify?
The reviewer's full form. The portfolio asserts "the repos are not typography" and reports 62/62 tests passing, 212/212 tests passing, 1,037 LOC, 994 LOC tests. How do I, as a reviewer, verify any of this in less than five minutes?
The receipt. One command:
git clone https://github.com/coreyalejandro/living-constitution.git
cd living-constitution
make verify
make verify returns 62/62 (Constitution) and 212/212 (PROACTIVE) in $\leq 90$ seconds on a 2024-era developer laptop. The output is reproducible byte-for-byte against the published hash in /provenance/test-results.json. The Makefile, the test suite, and the runtime registry are all open.
The honest concession. If you cannot reproduce the test results in 90 seconds, please file an issue at the runtime repo. This is the failure mode the engineering reviewer is most likely to expose; we treat such reports as P0 bugs.
Objection 8 — "Who are you? Why should I read this?"
The reviewer's full form. The portfolio is intensely about its author but does not present a coherent professional history, prior work, or independent reasons to take the author seriously as an AI safety contributor.
The receipt. /about/corey.md is the founder bio. It contains: prior work (with citations and outcomes), prior employers and roles, prior projects (MADMall and its current state), education, and stated identity (autistic, Black, with a history of housing insecurity — facts material to the work, not credentials). It does not pad. It does not list affiliations the author does not have.
The honest concession. The author is not credentialed in the conventional AI safety pipeline. The work is therefore unusual and asks to be read on its merits rather than on its provenance. The reviewer who reads only on provenance is making a category error in the other direction.
Objection 9 — "Where is the funding ask? What does this need to grow?"
The reviewer's full form. "Conditional on time, funding, or a collaborator" is a gesture. Where is the actual ask?
The receipt. /support/funding-ask.md. One page. Entity, fiscal sponsor, deliverables for 6/12/24 months, budget bands, the question the funder's dollars answer.
The honest concession. A funder cannot wire money to a personal portfolio site. The fiscal sponsor relationship described at /governance/fiscal-sponsor.md is the vehicle. If you are a funder reading this, the right entry point is /support/funding-ask.md.
Objection 10 — "The polemic against 'the guild' will close every door you need."
The reviewer's full form. The homepage explicitly antagonizes institutional AI safety. The same institutions are gatekeepers of funding, citation, and collaboration. The polemic is a strategic error.
The receipt. This is a value disagreement, not a factual one, so the response is not a receipt but a position. The polemic is editorial framing in service of an empirical claim. The empirical claim is in the paper, which can be read independent of the polemic. Institutions that read the paper on its merits and engage are welcome; institutions that read the polemic and disengage are exercising their own filter, which the polemic was partly designed to be.
The honest concession. This is a strategic risk. We have made a bet that the population of readers we lose to the polemic is smaller and less load-bearing than the population we gain by signaling that the work is not seeking the institutions' approval. That bet may be wrong. The empirical paper exists in a register that does not require this bet to be right.
This page is open for amendment. New objections from named or anonymous reviewers will be added with attribution (with permission) or anonymized notation. Send objections to objections@coreyalejandro.com.