-
Notifications
You must be signed in to change notification settings - Fork 1
Expand file tree
/
Copy pathCITATION.cff
More file actions
82 lines (82 loc) · 2.91 KB
/
CITATION.cff
File metadata and controls
82 lines (82 loc) · 2.91 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
cff-version: 1.2.0
message: "If you use this software in your research, please cite it as below."
type: software
title: "PhaseGPT: Kuramoto Phase-Coupled Oscillator Attention in Transformers"
abstract: >
A systematic investigation of Kuramoto phase-coupled oscillator mechanisms
in transformer attention layers for language modeling. This project presents
the first rigorous hyperparameter study of phase synchronization dynamics in
transformers, achieving 2.4% perplexity improvement on Shakespeare dataset
with optimal configuration (Layer 7, 32 oscillators, K=1.0 coupling).
The research reveals a critical over-synchronization paradox and establishes
the Goldilocks principle for oscillator count selection.
authors:
- family-names: "Two"
given-names: "Temple"
email: "contact@templetwo.dev"
affiliation: "Independent Researcher"
orcid: "https://orcid.org/0000-0000-0000-0000"
repository-code: "https://github.com/templetwo/PhaseGPT"
url: "https://github.com/templetwo/PhaseGPT"
keywords:
- transformers
- attention mechanism
- Kuramoto model
- phase-coupled oscillators
- synchronization dynamics
- language modeling
- neural networks
- interpretability
- hyperparameter optimization
license: MIT
version: 1.0.0
date-released: "2025-10-20"
doi: "10.17605/OSF.IO/ZQBC4"
identifiers:
- type: doi
value: "10.17605/OSF.IO/ZQBC4"
description: "OSF project DOI"
- type: url
value: "https://osf.io/zqbc4/"
description: "OSF project page"
preferred-citation:
type: article
title: "Optimizing Kuramoto Phase-Coupled Attention: A Systematic Hyperparameter Study"
authors:
- family-names: "Two"
given-names: "Temple"
year: 2025
journal: "In Preparation"
notes: >
Phase A complete: Systematic evaluation of 7 configurations identifying
optimal parameters (Layer 7, 32 oscillators, K=1.0) achieving 4.85 PPL
(2.4% improvement over baseline). Critical findings include Goldilocks
principle for oscillator count, catastrophic collapse at K=2.0, and
over-synchronization paradox (R=0.88). Phase B generalization experiments
preregistered but not executed due to resource constraints.
references:
- type: article
title: "Self-entrainment of a population of coupled non-linear oscillators"
authors:
- family-names: "Kuramoto"
given-names: "Yoshiki"
year: 1975
conference:
name: "International Symposium on Mathematical Problems in Theoretical Physics"
scope: "Foundational work on the Kuramoto model"
- type: article
title: "Attention is All You Need"
authors:
- family-names: "Vaswani"
given-names: "Ashish"
year: 2017
conference:
name: "NeurIPS"
scope: "Transformer architecture foundation"
- type: article
title: "Language Models are Unsupervised Multitask Learners"
authors:
- family-names: "Radford"
given-names: "Alec"
year: 2019
scope: "GPT-2 architecture"