Update thesis
@ -51,7 +51,7 @@ image:
|
||||
|
||||
<!-- Main Page Body -->
|
||||
<!--300 word version in comments-->
|
||||
<!-- Large systems of interacting quantum objects can give rise to a rich array of emergent behaviours. However this full description is notoriously difficult to use, defying many of the established theoretical techniques within the field. Luckily, we don't usually need to consider the full quantum many-body description. Strongly correlated materials (SCMs), however, do require the full description. Enter exactly solvable models, these can model SCMs but are much more amenable to standard theoretical techniques. This thesis focuses on two such exactly solvable models.
|
||||
<!-- Large systems of interacting quantum objects can give rise to a rich array of emergent behaviours. However, this full description is notoriously difficult to use, defying many of the established theoretical techniques within the field. Luckily, we don't usually need to consider the full quantum many-body description. Strongly correlated materials (SCMs), however, do require the full description. Enter exactly solvable models, these can model SCMs but are much more amenable to standard theoretical techniques. This thesis focuses on two such exactly solvable models.
|
||||
|
||||
The first, the Falicov-Kimball (FK) model is an exactly solvable limit of the famous Hubbard model which describes itinerant fermions interacting with a classical Ising background field. We will define a generalised FK model in 1D with long-range interactions. This model which shows a rich ground and thermodynamic phase diagram, similar to its higher dimensional cousins. We use an exact Markov Chain Monte Carlo method to map the phase diagram and compute the energy resolved localisation properties of the fermions. This allows us to look at how the move to 1D affects the physics of the model.
|
||||
|
||||
|
@ -40,7 +40,7 @@ image:
|
||||
<nav aria-label="Table of Contents" class="page-table-of-contents">
|
||||
<ul>
|
||||
<li><a href="#chap:1-introduction" id="toc-chap:1-introduction">1 Introduction</a></li>
|
||||
<li><a href="#interacting-quantum-many-body-systems" id="toc-interacting-quantum-many-body-systems">Interacting Quantum Many Body Systems</a></li>
|
||||
<li><a href="#interacting-quantum-many-body-systems" id="toc-interacting-quantum-many-body-systems">Interacting Quantum Many-Body Systems</a></li>
|
||||
<li><a href="#mott-insulators" id="toc-mott-insulators">Mott Insulators</a>
|
||||
<ul>
|
||||
<li><a href="#intro-the-fk-model" id="toc-intro-the-fk-model">The Falicov-Kimball Model</a></li>
|
||||
@ -60,7 +60,7 @@ image:
|
||||
<!-- <nav id="TOC" role="doc-toc">
|
||||
<ul>
|
||||
<li><a href="#chap:1-introduction" id="toc-chap:1-introduction">1 Introduction</a></li>
|
||||
<li><a href="#interacting-quantum-many-body-systems" id="toc-interacting-quantum-many-body-systems">Interacting Quantum Many Body Systems</a></li>
|
||||
<li><a href="#interacting-quantum-many-body-systems" id="toc-interacting-quantum-many-body-systems">Interacting Quantum Many-Body Systems</a></li>
|
||||
<li><a href="#mott-insulators" id="toc-mott-insulators">Mott Insulators</a>
|
||||
<ul>
|
||||
<li><a href="#intro-the-fk-model" id="toc-intro-the-fk-model">The Falicov-Kimball Model</a></li>
|
||||
@ -80,36 +80,35 @@ image:
|
||||
<h1>1 Introduction</h1>
|
||||
</section>
|
||||
<section id="interacting-quantum-many-body-systems" class="level1">
|
||||
<h1>Interacting Quantum Many Body Systems</h1>
|
||||
<p>When you take many objects and let them interact together, when we describe the behaviours that arise it is often easier to talks in terms of the group rather than the behaviour of the individual objects. A flock of starlings like that of fig. <a href="#fig:Studland_Starlings">1</a> is a good example. If you were to sit and watch a flock like this, you’d see that it has a distinct outline, that waves of density will sometimes propagate through the closely packed birds and that the flock seems to respond to predators as a distinct object. The natural description of this phenomenon is in terms of the flock, not the individual birds.</p>
|
||||
<p>A flock is an <em>emergent phenomenon</em>. The starlings are only interacting with their immediate six or seven neighbours <span class="citation" data-cites="king2012murmurations balleriniInteractionRulingAnimal2008"> [<a href="#ref-king2012murmurations" role="doc-biblioref">1</a>,<a href="#ref-balleriniInteractionRulingAnimal2008" role="doc-biblioref">2</a>]</span>, what a physicist would call a <em>local interaction</em>. There is much philosophical debate about how exactly to define emergence <span class="citation" data-cites="andersonMoreDifferent1972 kivelsonDefiningEmergencePhysics2016"> [<a href="#ref-andersonMoreDifferent1972" role="doc-biblioref">3</a>,<a href="#ref-kivelsonDefiningEmergencePhysics2016" role="doc-biblioref">4</a>]</span>. For our purposes, it is enough to say that emergence is the fact that the aggregate behaviour of many interacting objects may necessitate a radically different description from that of the individual objects.</p>
|
||||
<h1>Interacting Quantum Many-Body Systems</h1>
|
||||
<p>When you take many objects and let them interact together, complex behaviours can emerge. It is often easier to describe these behaviours in terms of properties of the group rather than properties of the individual objects. A flock of starlings like that of fig. <a href="#fig:Studland_Starlings">1</a> is a good example. If you were to sit and watch a flock like this, you’d see that it has a distinct outline, that waves of density will sometimes propagate through the closely packed birds and that the flock seems to respond to predators as a distinct object. The natural description of this phenomenon is in terms of the flock, not the individual birds.</p>
|
||||
<p>A flock is an <em>emergent phenomenon</em>. The starlings are only interacting with their immediate six or seven neighbours <span class="citation" data-cites="king2012murmurations balleriniInteractionRulingAnimal2008"> [<a href="#ref-king2012murmurations" role="doc-biblioref">1</a>,<a href="#ref-balleriniInteractionRulingAnimal2008" role="doc-biblioref">2</a>]</span>, what a physicist would call a <em>local interaction</em>. There is much philosophical debate about how exactly to define emergence <span class="citation" data-cites="andersonMoreDifferent1972 kivelsonDefiningEmergencePhysics2016"> [<a href="#ref-andersonMoreDifferent1972" role="doc-biblioref">3</a>,<a href="#ref-kivelsonDefiningEmergencePhysics2016" role="doc-biblioref">4</a>]</span> but, for our purposes, it is enough to say that emergence is the fact that the aggregate behaviour of many interacting objects may necessitate a radically different description from that of the individual objects.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/intro_chapter/Studland_Starlings.jpeg" id="fig-Studland_Starlings" data-short-caption="A murmuration of Starlings" style="width:100.0%" alt="Figure 1: A murmuration of starlings. Dorset, UK. Credit Tanya Hart, “Studland Starlings”, 2017, CC BY-SA 3.0" />
|
||||
<figcaption aria-hidden="true">Figure 1: A murmuration of starlings. Dorset, UK. Credit <a href="https://twitter.com/arripay">Tanya Hart</a>, “Studland Starlings”, 2017, <a href="https://creativecommons.org/licenses/by-sa/3.0/deed.en">CC BY-SA 3.0</a></figcaption>
|
||||
</figure>
|
||||
<p>To give an example closer to the topic at hand, our understanding of thermodynamics began with bulk properties like heat, energy, pressure and temperature <span class="citation" data-cites="saslowHistoryThermodynamicsMissing2020"> [<a href="#ref-saslowHistoryThermodynamicsMissing2020" role="doc-biblioref">5</a>]</span>. It was only later that we gained an understanding of how these properties emerge from microscopic interactions between very large numbers of particles <span class="citation" data-cites="flammHistoryOutlookStatistical1998"> [<a href="#ref-flammHistoryOutlookStatistical1998" role="doc-biblioref">6</a>]</span>.</p>
|
||||
<p>At its heart, condensed matter is the study of the behaviours that can emerge from large numbers of interacting quantum objects at low energy. When these three properties are present together (a large number of objects, those objects being quantum and the presence interactions between the objects), we call it an interacting quantum many body system. From these three ingredients, nature builds all manner of weird and wonderful materials.</p>
|
||||
<p>Historically, we first made headway by ignoring interactions and quantum properties and looking at purely many-body systems. The ideal gas law and the Drude classical electron gas <span class="citation" data-cites="ashcroftSolidStatePhysics1976"> [<a href="#ref-ashcroftSolidStatePhysics1976" role="doc-biblioref">7</a>]</span> are good examples. Including interactions leads to the Ising model <span class="citation" data-cites="isingBeitragZurTheorie1925"> [<a href="#ref-isingBeitragZurTheorie1925" role="doc-biblioref">8</a>]</span>, the Landau theory <span class="citation" data-cites="landauTheoryPhaseTransitions1937"> [<a href="#ref-landauTheoryPhaseTransitions1937" role="doc-biblioref">9</a>]</span> and the classical theory of phase transitions <span class="citation" data-cites="jaegerEhrenfestClassificationPhase1998"> [<a href="#ref-jaegerEhrenfestClassificationPhase1998" role="doc-biblioref">10</a>]</span>. In contrast, condensed matter theory got its start in quantum many-body theory where the only electron-electron interaction considered is the Pauli exclusion principle. Bloch’s theorem <span class="citation" data-cites="blochÜberQuantenmechanikElektronen1929"> [<a href="#ref-blochÜberQuantenmechanikElektronen1929" role="doc-biblioref">11</a>]</span>, the core result of band theory, predicted the properties of non-interacting electrons in crystal lattices. It predicted, in particular, that band insulators arise when the electrons bands are filled, leaving the fermi level in a bandgap <span class="citation" data-cites="ashcroftSolidStatePhysics1976"> [<a href="#ref-ashcroftSolidStatePhysics1976" role="doc-biblioref">7</a>]</span>. In the same vein, advances were made in understanding the quantum origins of magnetism, including ferromagnetism and antiferromagnetism <span class="citation" data-cites="MagnetismCondensedMatter"> [<a href="#ref-MagnetismCondensedMatter" role="doc-biblioref">12</a>]</span>.</p>
|
||||
<p>The development of Landau-Fermi liquid theory explained why band theory works so well even where an analysis of the relevant energies suggests that it should not <span class="citation" data-cites="wenQuantumFieldTheory2007"> [<a href="#ref-wenQuantumFieldTheory2007" role="doc-biblioref">13</a>]</span>. Landau Fermi Liquid theory demonstrates that, in many cases where electron-electron interactions are significant, the system can still be described in terms of generalised non-interacting quasiparticles. This happens when the properties of the quasiparticles in the interacting system can be smoothly connected to the free fermions of the non-interacting system.</p>
|
||||
<p>However, there are systems where even Landau-Fermi liquid theory fails. An effective theoretical description of these systems must include electron-electron correlations. They are thus called strongly correlated materials <span class="citation" data-cites="morosanStronglyCorrelatedMaterials2012"> [<a href="#ref-morosanStronglyCorrelatedMaterials2012" role="doc-biblioref">14</a>]</span>. The canonical examples are superconductivity <span class="citation" data-cites="MicroscopicTheorySuperconductivity"> [<a href="#ref-MicroscopicTheorySuperconductivity" role="doc-biblioref">15</a>]</span>, the fractional quantum hall effect <span class="citation" data-cites="feldmanFractionalChargeFractional2021"> [<a href="#ref-feldmanFractionalChargeFractional2021" role="doc-biblioref">16</a>]</span> and the Mott insulators <span class="citation" data-cites="mottBasisElectronTheory1949 fisherMottInsulatorsSpin1999"> [<a href="#ref-mottBasisElectronTheory1949" role="doc-biblioref">17</a>,<a href="#ref-fisherMottInsulatorsSpin1999" role="doc-biblioref">18</a>]</span>. We’ll start by looking at the latter but shall see that there are many links between the three topics.</p>
|
||||
<p>To give an example closer to the topic at hand, our understanding of thermodynamics began with bulk properties like heat, pressure, energy and temperature <span class="citation" data-cites="saslowHistoryThermodynamicsMissing2020"> [<a href="#ref-saslowHistoryThermodynamicsMissing2020" role="doc-biblioref">5</a>]</span>. It was only later that we gained an understanding of how these properties emerge from microscopic interactions between very large numbers of particles <span class="citation" data-cites="flammHistoryOutlookStatistical1998"> [<a href="#ref-flammHistoryOutlookStatistical1998" role="doc-biblioref">6</a>]</span>.</p>
|
||||
<p>At its heart, condensed matter is the study of the behaviours that can emerge from large numbers of interacting quantum objects at low energy. From these three ingredients,: a large number of objects, those objects being quantum and the presence of interactions between the objects, nature builds all manner of weird and wonderful things, see fig. <a href="#fig:venn_diagram">2</a> for examples. When these three properties are all present and important, we call it an interacting quantum many-body system. Such systems will be the focus of this thesis.</p>
|
||||
<p>Historically, we first made headway by ignoring interactions and quantum properties and looking at purely many-body systems. The ideal gas law and the Drude classical electron gas <span class="citation" data-cites="ashcroftSolidStatePhysics1976"> [<a href="#ref-ashcroftSolidStatePhysics1976" role="doc-biblioref">7</a>]</span> are good examples. Including interactions leads to the Ising model <span class="citation" data-cites="isingBeitragZurTheorie1925"> [<a href="#ref-isingBeitragZurTheorie1925" role="doc-biblioref">8</a>]</span>, Landau theory <span class="citation" data-cites="landauTheoryPhaseTransitions1937"> [<a href="#ref-landauTheoryPhaseTransitions1937" role="doc-biblioref">9</a>]</span> and the classical theory of phase transitions <span class="citation" data-cites="jaegerEhrenfestClassificationPhase1998"> [<a href="#ref-jaegerEhrenfestClassificationPhase1998" role="doc-biblioref">10</a>]</span>. In contrast, condensed matter theory got its start in quantum many-body theory where the only electron-electron interaction considered is the Pauli exclusion principle <a href="#fn1" class="footnote-ref" id="fnref1" role="doc-noteref"><sup>1</sup></a>. Bloch’s theorem <span class="citation" data-cites="blochÜberQuantenmechanikElektronen1929"> [<a href="#ref-blochÜberQuantenmechanikElektronen1929" role="doc-biblioref">11</a>]</span>, the core result of band theory, predicted the properties of non-interacting electrons in crystal lattices. In particular, it predicted that band insulators arise when the electrons bands are filled, leaving the fermi level in a bandgap <span class="citation" data-cites="ashcroftSolidStatePhysics1976"> [<a href="#ref-ashcroftSolidStatePhysics1976" role="doc-biblioref">7</a>]</span>. In the same vein, advances were made in understanding the quantum origins of magnetism, including ferromagnetism and antiferromagnetism <span class="citation" data-cites="MagnetismCondensedMatter"> [<a href="#ref-MagnetismCondensedMatter" role="doc-biblioref">12</a>]</span>.</p>
|
||||
<p>The development of Landau-Fermi liquid theory explained why band theory works so well even when an analysis of the relevant energies suggests that it should not <span class="citation" data-cites="wenQuantumFieldTheory2007"> [<a href="#ref-wenQuantumFieldTheory2007" role="doc-biblioref">13</a>]</span>. Landau-Fermi liquid theory demonstrates that, in many cases where electron-electron interactions are significant, the system can still be described in terms of generalised non-interacting quasiparticles. This description is applicable when the properties of the quasiparticles in the interacting system can be smoothly connected to the free fermions of the non-interacting system.</p>
|
||||
<p>However, there are systems where even Landau-Fermi liquid theory fails. An effective theoretical description of these systems must include electron-electron correlations. They are thus called strongly correlated materials <span class="citation" data-cites="morosanStronglyCorrelatedMaterials2012"> [<a href="#ref-morosanStronglyCorrelatedMaterials2012" role="doc-biblioref">14</a>]</span>. The canonical examples are superconductivity <span class="citation" data-cites="MicroscopicTheorySuperconductivity"> [<a href="#ref-MicroscopicTheorySuperconductivity" role="doc-biblioref">15</a>]</span>, the fractional quantum Hall effect <span class="citation" data-cites="feldmanFractionalChargeFractional2021"> [<a href="#ref-feldmanFractionalChargeFractional2021" role="doc-biblioref">16</a>]</span> and the Mott insulators <span class="citation" data-cites="mottBasisElectronTheory1949 fisherMottInsulatorsSpin1999"> [<a href="#ref-mottBasisElectronTheory1949" role="doc-biblioref">17</a>,<a href="#ref-fisherMottInsulatorsSpin1999" role="doc-biblioref">18</a>]</span>. We’ll start by looking at the latter but shall see that there are many links between the three topics.</p>
|
||||
</section>
|
||||
<section id="mott-insulators" class="level1">
|
||||
<h1>Mott Insulators</h1>
|
||||
<p>Mott Insulators (MIs) are remarkable because their electrical insulator properties come not from having filled bands but from electron-electron interactions other than Pauli exclusion. Electrical conductivity, the bulk movement of electrons, requires both that there are electronic states very close in energy to the ground state and that those states are delocalised so that they can contribute to macroscopic transport. Band insulators are systems whose Fermi level falls within a gap in the density of states and thus fail the first criteria. Band insulators derive their character from the characteristics of the underlying lattice. A third kind of insulator, the Anderson insulators, have only localised electronic states near the fermi level and therefore fail the second criteria. In a later section, I will discuss Anderson insulators and the disorder that drives them.</p>
|
||||
<p>Both band and Anderson insulators occur without electron-electron interactions. MIs, by contrast, require a many body picture to understand and thus elude band theory and single-particle methods.</p>
|
||||
<p>Mott Insulators (MIs) are remarkable because their electrical insulator properties come not from having filled bands but from electron-electron interactions other than Pauli exclusion. Electrical conductivity, the bulk movement of electrons, requires both that there are electronic states very close in energy to the ground state and that those states are delocalised so that they can contribute to macroscopic transport. Band insulators are systems whose Fermi level falls within a gap in the density of states: they fail the first criteria. Band insulators derive their insulating character from the characteristics of the underlying lattice. Another class of insulator, the Anderson insulators, are disordered so only have localised electronic states near the fermi level. They therefore fail the second criteria. In a later section, I will discuss Anderson insulators and the disorder that drives them. Both band and Anderson insulators occur without electron-electron interactions. MIs, by contrast, require a many-body picture to understand and thus elude band theory and single-particle methods.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/intro_chapter/venn_diagram.svg" id="fig-venn_diagram" data-short-caption="Interacting Quantum Many Body Systems Venn Diagram" style="width:100.0%" alt="Figure 2: Three key adjectives. Many Body: systems considered in the limit of large numbers of particles. Quantum: objects whose behaviour requires quantum mechanics to describe accurately. Interacting: the constituent particles of the system affect one another via forces, either directly or indirectly. When taken together, these three properties can give rise to strongly correlated materials." />
|
||||
<figcaption aria-hidden="true">Figure 2: Three key adjectives. <em>Many Body</em>: systems considered in the limit of large numbers of particles. <em>Quantum</em>: objects whose behaviour requires quantum mechanics to describe accurately. <em>Interacting</em>: the constituent particles of the system affect one another via forces, either directly or indirectly. When taken together, these three properties can give rise to strongly correlated materials.</figcaption>
|
||||
<img src="/assets/thesis/intro_chapter/venn_diagram.svg" id="fig-venn_diagram" data-short-caption="Interacting Quantum Many-Body Systems Venn Diagram" style="width:100.0%" alt="Figure 2: Three key adjectives. Many-Body: systems considered in the limit of large numbers of particles. Quantum: objects whose behaviour requires quantum mechanics to describe accurately. Interacting: the constituent particles of the system affect one another via forces, either directly or indirectly. When taken together, these three properties can give rise to strongly correlated materials." />
|
||||
<figcaption aria-hidden="true">Figure 2: Three key adjectives. <em>Many-Body</em>: systems considered in the limit of large numbers of particles. <em>Quantum</em>: objects whose behaviour requires quantum mechanics to describe accurately. <em>Interacting</em>: the constituent particles of the system affect one another via forces, either directly or indirectly. When taken together, these three properties can give rise to strongly correlated materials.</figcaption>
|
||||
</figure>
|
||||
<p>The theory of MIs developed out of the observation that band theory erroneously predicts that many transition metal oxides are conductive <span class="citation" data-cites="boerSemiconductorsPartiallyCompletely1937"> [<a href="#ref-boerSemiconductorsPartiallyCompletely1937" role="doc-biblioref">19</a>]</span>. It was suggested that electron-electron interactions were the cause of this effect <span class="citation" data-cites="mottDiscussionPaperBoer1937"> [<a href="#ref-mottDiscussionPaperBoer1937" role="doc-biblioref">20</a>]</span>. Interest grew with the discovery of high temperature superconductivity in the cuprates in 1986 <span class="citation" data-cites="bednorzPossibleHighTcSuperconductivity1986"> [<a href="#ref-bednorzPossibleHighTcSuperconductivity1986" role="doc-biblioref">21</a>]</span> which is believed to arise as the result of a doped MI state <span class="citation" data-cites="leeDopingMottInsulator2006"> [<a href="#ref-leeDopingMottInsulator2006" role="doc-biblioref">22</a>]</span>.</p>
|
||||
<p>The canonical toy model of the MI is the Hubbard model <span class="citation" data-cites="gutzwillerEffectCorrelationFerromagnetism1963 kanamoriElectronCorrelationFerromagnetism1963 hubbardj.ElectronCorrelationsNarrow1963"> [<a href="#ref-gutzwillerEffectCorrelationFerromagnetism1963" role="doc-biblioref">23</a>–<a href="#ref-hubbardj.ElectronCorrelationsNarrow1963" role="doc-biblioref">25</a>]</span> of spin-<span class="math inline">\(1/2\)</span> fermions hopping on the lattice with hopping parameter <span class="math inline">\(t\)</span> and electron-electron repulsion <span class="math inline">\(U\)</span></p>
|
||||
<p><span class="math display">\[ H_{\mathrm{H}} = -t \sum_{\langle i,j \rangle \alpha} c^\dagger_{i\alpha} c_{j\alpha} + U \sum_i n_{i\uparrow} n_{i\downarrow} - \mu \sum_{i,\alpha} n_{i\alpha},\]</span></p>
|
||||
<p>The theory of MIs developed out of the observation that band theory erroneously predicts that many transition metal oxides are conductive <span class="citation" data-cites="boerSemiconductorsPartiallyCompletely1937"> [<a href="#ref-boerSemiconductorsPartiallyCompletely1937" role="doc-biblioref">19</a>]</span>. It was suggested that electron-electron interactions were the cause of this effect <span class="citation" data-cites="mottDiscussionPaperBoer1937"> [<a href="#ref-mottDiscussionPaperBoer1937" role="doc-biblioref">20</a>]</span>. Interest grew further with the discovery of high temperature superconductivity in the cuprates in 1986 <span class="citation" data-cites="bednorzPossibleHighTcSuperconductivity1986"> [<a href="#ref-bednorzPossibleHighTcSuperconductivity1986" role="doc-biblioref">21</a>]</span> which is believed to arise as the result of a doped MI state <span class="citation" data-cites="leeDopingMottInsulator2006"> [<a href="#ref-leeDopingMottInsulator2006" role="doc-biblioref">22</a>]</span>.</p>
|
||||
<p>The canonical toy model of the MI is the Hubbard model <span class="citation" data-cites="gutzwillerEffectCorrelationFerromagnetism1963 kanamoriElectronCorrelationFerromagnetism1963 hubbardj.ElectronCorrelationsNarrow1963"> [<a href="#ref-gutzwillerEffectCorrelationFerromagnetism1963" role="doc-biblioref">23</a>–<a href="#ref-hubbardj.ElectronCorrelationsNarrow1963" role="doc-biblioref">25</a>]</span> of spin-<span class="math inline">\(1/2\)</span> fermions hopping on the lattice with hopping parameter <span class="math inline">\(t\)</span> and electron-electron repulsion <span class="math inline">\(U\)</span>, it reads</p>
|
||||
<p><span class="math display">\[ H_{\mathrm{H}} = -t \sum_{\langle i,j \rangle \alpha} c^\dagger_{i\alpha} c^{\phantom{\dagger}}_{j\alpha} + U \sum_i n_{i\uparrow} n_{i\downarrow} - \mu \sum_{i,\alpha} n_{i\alpha},\]</span></p>
|
||||
<p>where <span class="math inline">\(c^\dagger_{i\alpha}\)</span> creates a spin <span class="math inline">\(\alpha\)</span> electron at site <span class="math inline">\(i\)</span> and the number operator <span class="math inline">\(n_{i\alpha}\)</span> measures the number of electrons with spin <span class="math inline">\(\alpha\)</span> at site <span class="math inline">\(i\)</span>. The sum runs over lattice neighbours <span class="math inline">\(\langle i,j \rangle\)</span> including both <span class="math inline">\(\langle i,j \rangle\)</span> and <span class="math inline">\(\langle j,i \rangle\)</span> so that the model is Hermitian.</p>
|
||||
<p>In the non-interacting limit <span class="math inline">\(U << t\)</span>, the model reduces to free fermions and the many-body ground state is a separable product of Bloch waves filled up to the Fermi level. On the other hand, the ground state in the interacting limit <span class="math inline">\(U >> t\)</span> is a direct product of the local Hilbert spaces <span class="math inline">\(|0\rangle, |\uparrow\rangle, |\downarrow\rangle, |\uparrow\downarrow\rangle\)</span>. At half filling, one electron per site, each site becomes a <em>local moment</em> in the reduced Hilbert space <span class="math inline">\(|\uparrow\rangle, |\downarrow\rangle\)</span> and thus acts like a spin-<span class="math inline">\(1/2\)</span> <span class="citation" data-cites="hubbardElectronCorrelationsNarrow1964"> [<a href="#ref-hubbardElectronCorrelationsNarrow1964" role="doc-biblioref">26</a>]</span>.</p>
|
||||
<p>The Mott insulating phase occurs at half filling <span class="math inline">\(\mu = \tfrac{U}{2}\)</span>. Here the model can be rewritten in a symmetric form</p>
|
||||
<p><span class="math display">\[ H_{\mathrm{H}} = -t \sum_{\langle i,j \rangle \alpha} c^\dagger_{i\alpha} c_{j\alpha} + U \sum_i (n_{i\uparrow} - \tfrac{1}{2})(n_{i\downarrow} - \tfrac{1}{2}).\]</span></p>
|
||||
<p>The basic reason that the half filled state is insulating seems trivial. Any excitation must include states of double occupancy that cost energy <span class="math inline">\(U\)</span>. Hence, the system has a finite bandgap and is an interaction-driven MI. Depending on the lattice, the local moments may then order antiferromagnetically. Originally it was proposed that this Antiferromagnetic (AFM) order was actually the reason for the insulating behaviour. This would make sense since AFM order doubles the unit cell and can turn a system into a band insulator with an even number of electrons per unit cell <span class="citation" data-cites="mottMetalInsulatorTransitions1990"> [<a href="#ref-mottMetalInsulatorTransitions1990" role="doc-biblioref">27</a>]</span>. However, MIs have been found without magnetic order <span class="citation" data-cites="law1TTaS2QuantumSpin2017 ribakGaplessExcitationsGround2017"> [<a href="#ref-law1TTaS2QuantumSpin2017" role="doc-biblioref">28</a>,<a href="#ref-ribakGaplessExcitationsGround2017" role="doc-biblioref">29</a>]</span>. Instead, the local moments may form a highly entangled state known as a Quantum Spin Liquid (QSL), which will be discussed shortly.</p>
|
||||
<p>Various theoretical treatments of the Hubbard model have been made, including those based on Fermi liquid theory, mean field treatments, the local density approximation <span class="citation" data-cites="slaterMagneticEffectsHartreeFock1951"> [<a href="#ref-slaterMagneticEffectsHartreeFock1951" role="doc-biblioref">30</a>]</span>, dynamical mean-field theory <span class="citation" data-cites="greinerQuantumPhaseTransition2002"> [<a href="#ref-greinerQuantumPhaseTransition2002" role="doc-biblioref">31</a>]</span>, density matrix renormalisation group methods <span class="citation" data-cites="hallbergNewTrendsDensity2006 schollwöckDensitymatrixRenormalizationGroup2005 whiteDensityMatrixFormulation1992"> [<a href="#ref-hallbergNewTrendsDensity2006" role="doc-biblioref">32</a>–<a href="#ref-whiteDensityMatrixFormulation1992" role="doc-biblioref">34</a>]</span> and Markov chain Monte Carlo <span class="citation" data-cites="blankenbeclerMonteCarloCalculations1981 hirschDiscreteHubbardStratonovichTransformation1983 whiteNumericalStudyTwodimensional1989"> [<a href="#ref-blankenbeclerMonteCarloCalculations1981" role="doc-biblioref">35</a>–<a href="#ref-whiteNumericalStudyTwodimensional1989" role="doc-biblioref">37</a>]</span>. None of these approaches are perfect. Strong correlations are poorly described by Fermi liquid theory and LDA approaches while mean field approximations do poorly in low dimensional systems. This theoretical difficulty has made the Hubbard model a target for cold atom simulations <span class="citation" data-cites="mazurenkoColdatomFermiHubbard2017"> [<a href="#ref-mazurenkoColdatomFermiHubbard2017" role="doc-biblioref">38</a>]</span>.</p>
|
||||
<p>In the non-interacting limit <span class="math inline">\(U \ll t\)</span>, the model reduces to free fermions and the many-body ground state is a separable product of Bloch waves filled up to the Fermi level. On the other hand, the ground state in the interacting limit <span class="math inline">\(U \gg t\)</span> is a direct product of the local Hilbert spaces <span class="math inline">\(|0\rangle, |\uparrow\rangle, |\downarrow\rangle, |\uparrow\downarrow\rangle\)</span>. At half-filling, one electron per site, each site becomes a <em>local moment</em> in the reduced Hilbert space <span class="math inline">\(|\uparrow\rangle, |\downarrow\rangle\)</span> and thus acts like a spin-<span class="math inline">\(1/2\)</span> <span class="citation" data-cites="hubbardElectronCorrelationsNarrow1964"> [<a href="#ref-hubbardElectronCorrelationsNarrow1964" role="doc-biblioref">26</a>]</span>.</p>
|
||||
<p>The Mott insulating phase occurs at half-filling <span class="math inline">\(\mu = \tfrac{U}{2}\)</span>. Here the model can be rewritten in a symmetric form</p>
|
||||
<p><span class="math display">\[ H_{\mathrm{H}} = -t \sum_{\langle i,j \rangle \alpha} c^\dagger_{i\alpha} c^{\phantom{\dagger}}_{j\alpha} + U \sum_i (n_{i\uparrow} - \tfrac{1}{2})(n_{i\downarrow} - \tfrac{1}{2}).\]</span></p>
|
||||
<p>The basic reason that the half-filled state is insulating seems trivial. Any excitation must include states of double occupancy that cost energy <span class="math inline">\(U\)</span>. Hence, the system has a finite bandgap and is an interaction-driven MI. Depending on the lattice, the local moments may then order antiferromagnetically. Originally it was proposed that this antiferromagnetic (AFM) order was actually the reason for the insulating behaviour. This would make sense since AFM order doubles the unit cell and can turn a system into a band insulator with an even number of electrons per unit cell <span class="citation" data-cites="mottMetalInsulatorTransitions1990"> [<a href="#ref-mottMetalInsulatorTransitions1990" role="doc-biblioref">27</a>]</span>. However, MIs have been found without magnetic order <span class="citation" data-cites="law1TTaS2QuantumSpin2017 ribakGaplessExcitationsGround2017"> [<a href="#ref-law1TTaS2QuantumSpin2017" role="doc-biblioref">28</a>,<a href="#ref-ribakGaplessExcitationsGround2017" role="doc-biblioref">29</a>]</span>. Instead, the local moments may form a highly entangled state known as a Quantum Spin Liquid (QSL), which will be discussed shortly.</p>
|
||||
<p>Various theoretical treatments of the Hubbard model have been made, including those based on Fermi liquid theory, mean field treatments, the local density approximation <span class="citation" data-cites="slaterMagneticEffectsHartreeFock1951"> [<a href="#ref-slaterMagneticEffectsHartreeFock1951" role="doc-biblioref">30</a>]</span>, dynamical mean-field theory <span class="citation" data-cites="greinerQuantumPhaseTransition2002"> [<a href="#ref-greinerQuantumPhaseTransition2002" role="doc-biblioref">31</a>]</span>, density matrix renormalisation group methods <span class="citation" data-cites="hallbergNewTrendsDensity2006 schollwöckDensitymatrixRenormalizationGroup2005 whiteDensityMatrixFormulation1992"> [<a href="#ref-hallbergNewTrendsDensity2006" role="doc-biblioref">32</a>–<a href="#ref-whiteDensityMatrixFormulation1992" role="doc-biblioref">34</a>]</span> and Markov chain Monte Carlo <span class="citation" data-cites="blankenbeclerMonteCarloCalculations1981 hirschDiscreteHubbardStratonovichTransformation1983 whiteNumericalStudyTwodimensional1989"> [<a href="#ref-blankenbeclerMonteCarloCalculations1981" role="doc-biblioref">35</a>–<a href="#ref-whiteNumericalStudyTwodimensional1989" role="doc-biblioref">37</a>]</span>. None of these approaches are perfect. Strong correlations are poorly described by Landau-Fermi liquid theory and local density approximation approaches while mean field approximations do poorly in low dimensional systems. This theoretical difficulty has made the Hubbard model a target for cold atom simulations <span class="citation" data-cites="mazurenkoColdatomFermiHubbard2017"> [<a href="#ref-mazurenkoColdatomFermiHubbard2017" role="doc-biblioref">38</a>]</span>.</p>
|
||||
<p>From here, the discussion will branch in two directions. First, I will discuss a limit of the Hubbard model called the Falicov-Kimball model. Second, I will look at QSLs and the Kitaev honeycomb model.</p>
|
||||
<section id="intro-the-fk-model" class="level3">
|
||||
<h3>The Falicov-Kimball Model</h3>
|
||||
@ -117,18 +116,18 @@ image:
|
||||
<img src="/assets/thesis/intro_chapter/fk_schematic.svg" id="fig-fk_schematic" data-short-caption="Falicov-Kimball Model Diagram" style="width:100.0%" alt="Figure 3: The Falicov-Kimball model can be viewed as a model of classical spins S_i coupled to spinless fermions \hat{c}_i where the fermions are mobile with hopping t and the fermions are coupled to the spins by an Ising type interaction with strength U." />
|
||||
<figcaption aria-hidden="true">Figure 3: The Falicov-Kimball model can be viewed as a model of classical spins <span class="math inline">\(S_i\)</span> coupled to spinless fermions <span class="math inline">\(\hat{c}_i\)</span> where the fermions are mobile with hopping <span class="math inline">\(t\)</span> and the fermions are coupled to the spins by an Ising type interaction with strength <span class="math inline">\(U\)</span>.</figcaption>
|
||||
</figure>
|
||||
<p>The Falicov-Kimball (FK) model was originally introduced to describe the metal-insulator transition in f-electron system <span class="citation" data-cites="hubbardj.ElectronCorrelationsNarrow1963 falicovSimpleModelSemiconductorMetal1969"> [<a href="#ref-hubbardj.ElectronCorrelationsNarrow1963" role="doc-biblioref">25</a>,<a href="#ref-falicovSimpleModelSemiconductorMetal1969" role="doc-biblioref">39</a>]</span>. The FK model is the limit of the Hubbard model as the mass of one of the spins states of the electron is taken to infinity. This gives a model with two fermion species, one itinerant and one entirely immobile. The number operators for the immobile fermions are therefore conserved quantities and can be treated like classical degrees of freedom. For our purposes, it will be useful to replace the immobile fermions with a classical Ising background field <span class="math inline">\(S_i = \pm1\)</span>.</p>
|
||||
<p>The Falicov-Kimball (FK) model was originally introduced to describe the metal-insulator transition in f-electron systems <span class="citation" data-cites="hubbardj.ElectronCorrelationsNarrow1963 falicovSimpleModelSemiconductorMetal1969"> [<a href="#ref-hubbardj.ElectronCorrelationsNarrow1963" role="doc-biblioref">25</a>,<a href="#ref-falicovSimpleModelSemiconductorMetal1969" role="doc-biblioref">39</a>]</span>. Shown graphically in fig. <a href="#fig:fk_schematic">3</a>, the FK model is the limit of the Hubbard model as the mass of one of the spin states of the electron is taken to infinity. This gives a model with two fermion species, one itinerant and one entirely immobile. The number operators for the immobile fermions are therefore conserved quantities and can be treated like classical degrees of freedom. For our purposes, it will be useful to replace the immobile fermions with a classical Ising background field <span class="math inline">\(S_i = \pm1\)</span>. At half filing and with this substitution, the Hamiltonian reads</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
H_{\mathrm{FK}} = & -\;t \sum_{\langle i,j \rangle} c^\dagger_{i}c_{j} + \;U \sum_{i} S_i\;(c^\dagger_{i}c_{i} - \tfrac{1}{2}). \\
|
||||
H_{\mathrm{FK}} = & -\;t \sum_{\langle i,j \rangle} c^\dagger_{i}c^{\phantom{\dagger}}_{j} + \;U \sum_{i} S_i\;(c^\dagger_{i}c^{\phantom{\dagger}}_{i} - \tfrac{1}{2}). \\
|
||||
\end{aligned}\]</span></p>
|
||||
<p>The physics of states near the metal-insulator transition is still poorly understood <span class="citation" data-cites="belitzAndersonMottTransition1994 baskoMetalInsulatorTransition2006"> [<a href="#ref-belitzAndersonMottTransition1994" role="doc-biblioref">40</a>,<a href="#ref-baskoMetalInsulatorTransition2006" role="doc-biblioref">41</a>]</span>. As a result, the FK model provides a rich test bed to explore interaction-driven metal-insulator transition physics. Despite its simplicity, the model has a rich phase diagram in <span class="math inline">\(D \geq 2\)</span> dimensions. It shows a Mott insulator transition even at high temperature, similar to the corresponding Hubbard model <span class="citation" data-cites="brandtThermodynamicsCorrelationFunctions1989"> [<a href="#ref-brandtThermodynamicsCorrelationFunctions1989" role="doc-biblioref">42</a>]</span>. In 1D, the ground state phenomenology as a function of filling can be rich <span class="citation" data-cites="gruberGroundStatesSpinless1990"> [<a href="#ref-gruberGroundStatesSpinless1990" role="doc-biblioref">43</a>]</span>, but the system is disordered for all <span class="math inline">\(T > 0\)</span> <span class="citation" data-cites="kennedyItinerantElectronModel1986"> [<a href="#ref-kennedyItinerantElectronModel1986" role="doc-biblioref">44</a>]</span>. The model has also been a test-bed for many-body methods. Interest took off when an exact dynamical mean-field theory solution in the infinite dimensional case was found <span class="citation" data-cites="antipovCriticalExponentsStrongly2014 ribicNonlocalCorrelationsSpectral2016 freericksExactDynamicalMeanfield2003 herrmannNonequilibriumDynamicalCluster2016"> [<a href="#ref-antipovCriticalExponentsStrongly2014" role="doc-biblioref">45</a>–<a href="#ref-herrmannNonequilibriumDynamicalCluster2016" role="doc-biblioref">48</a>]</span>.</p>
|
||||
<p>In <a href="../3_Long_Range_Falicov_Kimball/3.1_LRFK_Model.html">chapter 3</a>, I will introduce a generalised Falicov-Kimball model in 1D I call the Long-Range Falicov-Kimball model. With the addition of long-range interactions in the background field, the model shows a similarly rich phase diagram like its higher dimensional cousins. Our goal is to understand the Mott transition in more detail, the phase transition into a charge density wave state and how the localisation properties of the fermionic sector behave in 1D. We were particularly interested to see if correlations in the disorder potential are enough to bring about localisation effects, such as mobility edges, that are normally only seen in higher dimensions. I use an exact Markov chain Monte Carlo method to map the phase diagram and compute the energy-resolved localisation properties of the fermions. We observe what appears to be a hint of coexisting localised and delocalised states. However, after careful comparison to an Anderson model of uncorrelated binary disorder about a background charge density wave field, we confirm that the fermionic sector does fully localise at larger system sizes as expected for 1D systems.</p>
|
||||
<p>In <a href="../3_Long_Range_Falicov_Kimball/3.1_LRFK_Model.html">chapter 3</a>, I will introduce a generalised Falicov-Kimball model in 1D I call the Long-Range Falicov-Kimball model. With the addition of long-range interactions in the background field, the model shows a rich phase diagram like its higher dimensional cousins. Our goal is to understand the Mott transition in more detail, the phase transition into a charge density wave state and how the localisation properties of the fermionic sector behave in 1D. I was particularly interested to see if correlations in the disorder potential are enough to bring about localisation effects, such as mobility edges, that are normally only seen in higher dimensions. I use an exact Markov chain Monte Carlo method to map the phase diagram and compute the energy-resolved localisation properties of the fermions. We observe what appears to be a hint of coexisting localised and delocalised states. However, after careful comparison to an Anderson model of uncorrelated binary disorder about a background charge density wave field, we confirm that the fermionic sector does fully localise at larger system sizes as expected for 1D systems.</p>
|
||||
</section>
|
||||
</section>
|
||||
<section id="quantum-spin-liquids" class="level1">
|
||||
<h1>Quantum Spin Liquids</h1>
|
||||
<p>Turning to the other key topic of this thesis, we have already discussed the AFM ordering of local moments in the Mott insulating state. Landau-Ginzburg-Wilson theory characterises phases of matter as inextricably linked to the emergence of long-range order via a spontaneously broken symmetry. Within this paradigm, we would not expect any interesting phases of matter not associated with AFM or other long-range order. However, Anderson first proposed in 1973 <span class="citation" data-cites="Anderson1973"> [<a href="#ref-Anderson1973" role="doc-biblioref">49</a>]</span> that, if long-range order is suppressed by some mechanism, it might lead to a liquid-like state even at zero temperature: a QSL.</p>
|
||||
<p>This QSL state would exist at zero or very low temperatures. Therefore, we would expect quantum effects to be very strong, which will have far reaching consequences. It was the discovery of a different phase, however, that really kickstarted interest in the topic. The fractional quantum Hall state, discovered in the 1980s <span class="citation" data-cites="laughlinAnomalousQuantumHall1983"> [<a href="#ref-laughlinAnomalousQuantumHall1983" role="doc-biblioref">50</a>]</span> is an explicit example of an interacting electron system that falls outside of the Landau-Ginzburg-Wilson paradigm<a href="#fn1" class="footnote-ref" id="fnref1" role="doc-noteref"><sup>1</sup></a>. It shares many phenomenological properties with the QSL state. They both exhibit fractionalised excitations, braiding statistics and non-trivial topological properties <span class="citation" data-cites="broholmQuantumSpinLiquids2020"> [<a href="#ref-broholmQuantumSpinLiquids2020" role="doc-biblioref">55</a>]</span>. The many-body ground state of such systems acts as a complex and highly entangled vacuum. This vacuum can support quasiparticle excitations with properties unbound from that of the Dirac fermions of the standard model.</p>
|
||||
<p>This QSL state would exist at zero or very low temperatures. Therefore, we would expect quantum effects to be very strong, which will have far reaching consequences. It was the discovery of a different phase, however, that really kickstarted interest in the topic. The fractional quantum Hall state, discovered in the 1980s <span class="citation" data-cites="laughlinAnomalousQuantumHall1983"> [<a href="#ref-laughlinAnomalousQuantumHall1983" role="doc-biblioref">50</a>]</span> is an explicit example of an interacting electron system that falls outside of the Landau-Ginzburg-Wilson paradigm<a href="#fn2" class="footnote-ref" id="fnref2" role="doc-noteref"><sup>2</sup></a>. It shares many phenomenological properties with the QSL state. They both exhibit fractionalised excitations, braiding statistics and non-trivial topological properties <span class="citation" data-cites="broholmQuantumSpinLiquids2020"> [<a href="#ref-broholmQuantumSpinLiquids2020" role="doc-biblioref">55</a>]</span>. The many-body ground state of such systems acts as a complex and highly entangled vacuum. This vacuum can support quasiparticle excitations with properties unbound from that of the Dirac fermions of the standard model.</p>
|
||||
<p>How do we actually make a QSL? Frustration is one mechanism that we can use to suppress magnetic order in spin models <span class="citation" data-cites="TrebstPhysRep2022"> [<a href="#ref-TrebstPhysRep2022" role="doc-biblioref">56</a>]</span>. Frustration can be geometric. Triangular lattices, for instance, cannot support AFM order. It can also come about as a result of spin-orbit coupling or other physics. There are also other routes to QSLs besides frustrated spin systems that we will not discuss here <span class="citation" data-cites="balentsNodalLiquidTheory1998 balentsDualOrderParameter1999 linExactSymmetryWeaklyinteracting1998"> [<a href="#ref-balentsNodalLiquidTheory1998" role="doc-biblioref">57</a>–<a href="#ref-linExactSymmetryWeaklyinteracting1998" role="doc-biblioref">59</a>]</span>.</p>
|
||||
<!-- Experimentally, Mott insulating systems without magnetic order have been proposed as QSL systems\ [@law1TTaS2QuantumSpin2017; @ribakGaplessExcitationsGround2017]. -->
|
||||
<!-- Other exampels: Quantum spin liquids are the analogous phase of matter for spin systems. Spin ice support deconfined magnetic monopoles. -->
|
||||
@ -137,14 +136,14 @@ H_{\mathrm{FK}} = & -\;t \sum_{\langle i,j \rangle} c^\dagger_{i}c_{j} + \;U
|
||||
<figcaption aria-hidden="true">Figure 4: How Kitaev materials fit into the picture of strongly correlated systems. Interactions are required to open a Mott gap and localise the electrons into local moments, while spin-orbit correlations are required to produce the strongly anisotropic spin-spin couplings of the Kitaev model. Reproduced from <span class="citation" data-cites="TrebstPhysRep2022"> [<a href="#ref-TrebstPhysRep2022" role="doc-biblioref">56</a>]</span>.</figcaption>
|
||||
</figure>
|
||||
<p>Spin-orbit coupling is a relativistic effect that, very roughly, corresponds to the fact that in the frame of reference of a moving electron the electric field of nearby nuclei looks like a magnetic field to which the electron spin couples. This couples the spatial and spin parts of the electron wavefunction. The lattice structure can therefore influence the form of the spin-spin interactions, leading to spatial anisotropy in the effective interactions. This spatial anisotropy can frustrate an MI state <span class="citation" data-cites="jackeliMottInsulatorsStrong2009 khaliullinOrbitalOrderFluctuations2005"> [<a href="#ref-jackeliMottInsulatorsStrong2009" role="doc-biblioref">60</a>,<a href="#ref-khaliullinOrbitalOrderFluctuations2005" role="doc-biblioref">61</a>]</span> leading to more exotic ground states than the AFM order we have seen so far. As with the Hubbard model, interaction effects are only strong or weak in comparison to the bandwidth or hopping integral <span class="math inline">\(t\)</span>. Hence, we will see strong frustration in materials with strong spin-orbit coupling <span class="math inline">\(\lambda\)</span> relative to their bandwidth <span class="math inline">\(t\)</span>.</p>
|
||||
<p>In certain transition metal based compounds, such as those based on Iridium and Ruthenium, the lattice structure, strong spin-orbit coupling and narrow bandwidths lead to effective spin-<span class="math inline">\(\tfrac{1}{2}\)</span> Mott insulating states with strongly anisotropic spin-spin couplings. These transition metal compounds, known as Kitaev materials, draw their name from the celebrated Kitaev Honeycomb (KH) model which is expected to model their low temperature behaviour <span class="citation" data-cites="Jackeli2009 HerrmannsAnRev2018 Winter2017 TrebstPhysRep2022 Takagi2019"> [<a href="#ref-TrebstPhysRep2022" role="doc-biblioref">56</a>,<a href="#ref-Jackeli2009" role="doc-biblioref">62</a>–<a href="#ref-Takagi2019" role="doc-biblioref">65</a>]</span>.</p>
|
||||
<p>At this point, we can sketch out a phase diagram like that of fig. <a href="#fig:kitaev-material-phase-diagram">4</a>. When both electron-electron interactions <span class="math inline">\(U\)</span> and spin-orbit couplings <span class="math inline">\(\lambda\)</span> are small relative to the bandwidth <span class="math inline">\(t\)</span>, we recover standard band theory of band insulators and metals. In the upper left, we have the simple Mott insulating state as described by the Hubbard model. In the lower right, strong spin-orbit coupling gives rise to topological insulators characterised by symmetry protected edge modes and non-zero Chern number. Kitaev materials occur in the region where strong electron-electron interaction and spin-orbit coupling interact. See <span class="citation" data-cites="witczak-krempaCorrelatedQuantumPhenomena2014"> [<a href="#ref-witczak-krempaCorrelatedQuantumPhenomena2014" role="doc-biblioref">66</a>]</span> for a more expansive version of this diagram.</p>
|
||||
<p>The KH model <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">67</a>]</span> was one of the first exactly solvable spin models with a QSL ground state. It is defined on the 2D honeycomb lattice and provides an exactly solvable model that can be reduced to a free fermion problem via a mapping to Majorana fermions. This yields an extensive number of static <span class="math inline">\(\mathbb Z_2\)</span> fluxes tied to an emergent gauge field. The model is remarkable not only for its QSL ground state, but also for its fractionalised excitations with non-trivial braiding statistics. It has a rich phase diagram hosting gapless, Abelian and non-Abelian phases <span class="citation" data-cites="knolleDynamicsFractionalizationQuantum2015"> [<a href="#ref-knolleDynamicsFractionalizationQuantum2015" role="doc-biblioref">68</a>]</span> and a finite temperature phase transition to a thermal metal state <span class="citation" data-cites="selfThermallyInducedMetallic2019"> [<a href="#ref-selfThermallyInducedMetallic2019" role="doc-biblioref">69</a>]</span>. It has been proposed that its non-Abelian excitations could be used to support robust topological quantum computing <span class="citation" data-cites="kitaev_fault-tolerant_2003 freedmanTopologicalQuantumComputation2003 nayakNonAbelianAnyonsTopological2008"> [<a href="#ref-kitaev_fault-tolerant_2003" role="doc-biblioref">70</a>–<a href="#ref-nayakNonAbelianAnyonsTopological2008" role="doc-biblioref">72</a>]</span>.</p>
|
||||
<p>In certain transition metal based compounds, such as those based on iridium and ruthenium, the lattice structure, strong spin-orbit coupling and narrow bandwidths lead to effective spin-<span class="math inline">\(\tfrac{1}{2}\)</span> Mott insulating states with strongly anisotropic spin-spin couplings. These transition metal compounds, known as Kitaev materials, draw their name from the celebrated Kitaev Honeycomb (KH) model which is expected to model their low temperature behaviour <span class="citation" data-cites="Jackeli2009 HerrmannsAnRev2018 Winter2017 TrebstPhysRep2022 Takagi2019"> [<a href="#ref-TrebstPhysRep2022" role="doc-biblioref">56</a>,<a href="#ref-Jackeli2009" role="doc-biblioref">62</a>–<a href="#ref-Takagi2019" role="doc-biblioref">65</a>]</span>.</p>
|
||||
<p>At this point, we can sketch out a phase diagram like that of fig. <a href="#fig:kitaev-material-phase-diagram">4</a>. When both electron-electron interactions <span class="math inline">\(U\)</span> and spin-orbit couplings <span class="math inline">\(\lambda\)</span> are small relative to the bandwidth <span class="math inline">\(t\)</span>, we recover standard band theory of band insulators and metals. In the upper left, we have the simple Mott insulating state as described by the Hubbard model. In the lower right, strong spin-orbit coupling gives rise to topological insulators characterised by symmetry protected edge modes and non-zero Chern number. Kitaev materials occur in the region where strong electron-electron interaction and spin-orbit coupling interact. See ref. <span class="citation" data-cites="witczak-krempaCorrelatedQuantumPhenomena2014"> [<a href="#ref-witczak-krempaCorrelatedQuantumPhenomena2014" role="doc-biblioref">66</a>]</span> for a more expansive version of this diagram.</p>
|
||||
<p>The KH model <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">67</a>]</span> was one of the early exactly solvable spin models with a QSL ground state. It is defined on the 2D honeycomb lattice and provides an exactly solvable model that can be reduced to a free fermion problem via a mapping to Majorana fermions. This yields an extensive number of static <span class="math inline">\(\mathbb Z_2\)</span> fluxes tied to an emergent gauge field. The model is remarkable not only for its QSL ground state, but also for its fractionalised excitations with non-trivial braiding statistics. It has a rich phase diagram hosting gapless, Abelian and non-Abelian phases <span class="citation" data-cites="knolleDynamicsFractionalizationQuantum2015"> [<a href="#ref-knolleDynamicsFractionalizationQuantum2015" role="doc-biblioref">68</a>]</span> and a finite temperature phase transition to a thermal metal state <span class="citation" data-cites="selfThermallyInducedMetallic2019"> [<a href="#ref-selfThermallyInducedMetallic2019" role="doc-biblioref">69</a>]</span>. It has been proposed that its non-Abelian excitations could be used to support robust topological quantum computing <span class="citation" data-cites="kitaev_fault-tolerant_2003 freedmanTopologicalQuantumComputation2003 nayakNonAbelianAnyonsTopological2008"> [<a href="#ref-kitaev_fault-tolerant_2003" role="doc-biblioref">70</a>–<a href="#ref-nayakNonAbelianAnyonsTopological2008" role="doc-biblioref">72</a>]</span>.</p>
|
||||
<p>The KH and FK models have quite a bit of conceptual overlap. They can both be seen as models of spinless fermions coupled to a classical Ising background field. This is what makes them exactly solvable. At finite temperatures, fluctuations in their background fields provide an effective disorder potential for the fermionic sector, so both models can be studied at finite temperature with Markov chain Monte Carlo methods <span class="citation" data-cites="antipovInteractionTunedAndersonMott2016 selfThermallyInducedMetallic2019"> [<a href="#ref-selfThermallyInducedMetallic2019" role="doc-biblioref">69</a>,<a href="#ref-antipovInteractionTunedAndersonMott2016" role="doc-biblioref">73</a>]</span>.</p>
|
||||
<p>As Kitaev points out in his original paper, the KH model remains solvable on any trivalent <span class="math inline">\(z=3\)</span> graph which can be three-edge-coloured. Indeed, many generalisations of the model exist <span class="citation" data-cites="Baskaran2007 Baskaran2008 Nussinov2009 OBrienPRB2016 hermanns2015weyl"> [<a href="#ref-Baskaran2007" role="doc-biblioref">74</a>–<a href="#ref-hermanns2015weyl" role="doc-biblioref">78</a>]</span>. Notably, the Yao-Kivelson model <span class="citation" data-cites="yaoExactChiralSpin2007"> [<a href="#ref-yaoExactChiralSpin2007" role="doc-biblioref">79</a>]</span> introduces triangular plaquettes to the honeycomb lattice leading to spontaneous chiral symmetry breaking. These extensions all retain translation symmetry. This is likely because edge-colouring, finding the ground state and understanding the QSL properties are much harder without it <span class="citation" data-cites="eschmann2019thermodynamics Peri2020"> [<a href="#ref-eschmann2019thermodynamics" role="doc-biblioref">80</a>,<a href="#ref-Peri2020" role="doc-biblioref">81</a>]</span>. Undeterred, this gap led us to wonder what might happen if we remove translation symmetry from the Kitaev model. This would be a model of a trivalent, highly bond anisotropic but otherwise amorphous material.</p>
|
||||
<p>Amorphous materials do not have long-range lattice regularities but may have short-range regularities in the lattice structure, such as fixed coordination number <span class="math inline">\(z\)</span> as in some covalent compounds. The best examples are amorphous Silicon and Germanium with <span class="math inline">\(z=4\)</span> which are used to make thin-film solar cells <span class="citation" data-cites="Weaire1971 betteridge1973possible"> [<a href="#ref-Weaire1971" role="doc-biblioref">82</a>,<a href="#ref-betteridge1973possible" role="doc-biblioref">83</a>]</span>. Recently, it has been shown that topological insulating (TI) phases can exist in amorphous systems. Amorphous TIs are characterised by similar protected edge states to their translation invariant cousins and generalised topological bulk invariants <span class="citation" data-cites="mitchellAmorphousTopologicalInsulators2018 agarwala2019topological marsalTopologicalWeaireThorpeModels2020 costa2019toward agarwala2020higher spring2021amorphous corbae2019evidence"> [<a href="#ref-mitchellAmorphousTopologicalInsulators2018" role="doc-biblioref">84</a>–<a href="#ref-corbae2019evidence" role="doc-biblioref">90</a>]</span>. However, research on amorphous electronic systems has mostly focused on non-interacting systems with a few exceptions, for example, to account for the observation of superconductivity <span class="citation" data-cites="buckel1954einfluss mcmillan1981electron meisel1981eliashberg bergmann1976amorphous mannaNoncrystallineTopologicalSuperconductors2022"> [<a href="#ref-buckel1954einfluss" role="doc-biblioref">91</a>–<a href="#ref-mannaNoncrystallineTopologicalSuperconductors2022" role="doc-biblioref">95</a>]</span> in amorphous materials or very recently to understand the effect of strong electron repulsion in TIs <span class="citation" data-cites="kim2022fractionalization"> [<a href="#ref-kim2022fractionalization" role="doc-biblioref">96</a>]</span>.</p>
|
||||
<p>Amorphous materials do not have long-range lattice regularities but may have short-range regularities in the lattice structure, such as fixed coordination number <span class="math inline">\(z\)</span> as in some covalent compounds. The best examples are amorphous silicon and germanium with <span class="math inline">\(z=4\)</span> which are used to make thin-film solar cells <span class="citation" data-cites="Weaire1971 betteridge1973possible"> [<a href="#ref-Weaire1971" role="doc-biblioref">82</a>,<a href="#ref-betteridge1973possible" role="doc-biblioref">83</a>]</span>. Recently, it has been shown that topological insulating (TI) phases can exist in amorphous systems. Amorphous TIs are characterised by similar protected edge states to their translation invariant cousins and generalised topological bulk invariants <span class="citation" data-cites="mitchellAmorphousTopologicalInsulators2018 agarwala2019topological marsalTopologicalWeaireThorpeModels2020 costa2019toward agarwala2020higher spring2021amorphous corbae2019evidence"> [<a href="#ref-mitchellAmorphousTopologicalInsulators2018" role="doc-biblioref">84</a>–<a href="#ref-corbae2019evidence" role="doc-biblioref">90</a>]</span>. However, research on amorphous electronic systems has mostly focused on non-interacting systems with a few exceptions, for example, to account for the observation of superconductivity <span class="citation" data-cites="buckel1954einfluss mcmillan1981electron meisel1981eliashberg bergmann1976amorphous mannaNoncrystallineTopologicalSuperconductors2022"> [<a href="#ref-buckel1954einfluss" role="doc-biblioref">91</a>–<a href="#ref-mannaNoncrystallineTopologicalSuperconductors2022" role="doc-biblioref">95</a>]</span> in amorphous materials or very recently to understand the effect of strong electron repulsion in TIs <span class="citation" data-cites="kim2022fractionalization"> [<a href="#ref-kim2022fractionalization" role="doc-biblioref">96</a>]</span>.</p>
|
||||
<p>Amorphous <em>magnetic</em> systems have been investigated since the 1960s, mostly through the adaptation of theoretical tools developed for disordered systems <span class="citation" data-cites="aharony1975critical Petrakovski1981 kaneyoshi1992introduction Kaneyoshi2018"> [<a href="#ref-aharony1975critical" role="doc-biblioref">97</a>–<a href="#ref-Kaneyoshi2018" role="doc-biblioref">100</a>]</span> and with numerical methods <span class="citation" data-cites="fahnle1984monte plascak2000ising"> [<a href="#ref-fahnle1984monte" role="doc-biblioref">101</a>,<a href="#ref-plascak2000ising" role="doc-biblioref">102</a>]</span>. Research on classical Heisenberg and Ising models accounts for the observed behaviour of ferromagnetism, disordered antiferromagnetism and widely observed spin glass behaviour <span class="citation" data-cites="coey1978amorphous"> [<a href="#ref-coey1978amorphous" role="doc-biblioref">103</a>]</span>. However, the role of spin-anisotropic interactions and quantum effects in amorphous magnets has not been addressed.</p>
|
||||
<p>In <a href="../4_Amorphous_Kitaev_Model/4.1_AMK_Model.html">chapter 4</a>, I will address the question of whether frustrated magnetic interactions on amorphous lattices can give rise to genuine quantum phases, i.e. to long-range entangled QSL <span class="citation" data-cites="Anderson1973 Knolle2019 Savary2016 Lacroix2011"> [<a href="#ref-Anderson1973" role="doc-biblioref">49</a>,<a href="#ref-Knolle2019" role="doc-biblioref">104</a>–<a href="#ref-Lacroix2011" role="doc-biblioref">106</a>]</span>. We will find that the answer is yes. I will introduce the amorphous Kitaev model, a generalisation of the KH model to random lattices with fixed coordination number three. I will show that this model is a solvable, amorphous, chiral spin liquid. As with the Yao-Kivelson model <span class="citation" data-cites="yaoExactChiralSpin2007"> [<a href="#ref-yaoExactChiralSpin2007" role="doc-biblioref">79</a>]</span>, the amorphous Kitaev model retains its exact solubility but the presence of plaquettes with an odd number of sides leads to a spontaneous breaking of time reversal symmetry. I will confirm prior observations that the form of the ground state is relatively simple <span class="citation" data-cites="OBrienPRB2016 eschmannThermodynamicClassificationThreedimensional2020"> [<a href="#ref-OBrienPRB2016" role="doc-biblioref">77</a>,<a href="#ref-eschmannThermodynamicClassificationThreedimensional2020" role="doc-biblioref">107</a>]</span> and unearth a rich phase diagram displaying Abelian as well as a non-Abelian chiral spin liquid phases. Furthermore, I will show that the system undergoes a finite-temperature phase transition to a thermal metal state and discuss possible experimental realisations.</p>
|
||||
<p>In <a href="../4_Amorphous_Kitaev_Model/4.1_AMK_Model.html">chapter 4</a>, I will address the question of whether frustrated magnetic interactions on amorphous lattices can give rise to genuine quantum phases, i.e., to long-range entangled QSL <span class="citation" data-cites="Anderson1973 Knolle2019 Savary2016 Lacroix2011"> [<a href="#ref-Anderson1973" role="doc-biblioref">49</a>,<a href="#ref-Knolle2019" role="doc-biblioref">104</a>–<a href="#ref-Lacroix2011" role="doc-biblioref">106</a>]</span>. We will find that the answer is yes. I will introduce the amorphous Kitaev model, a generalisation of the KH model to random lattices with fixed coordination number three. I will show that this model is a solvable, amorphous, chiral spin liquid. As with the Yao-Kivelson model <span class="citation" data-cites="yaoExactChiralSpin2007"> [<a href="#ref-yaoExactChiralSpin2007" role="doc-biblioref">79</a>]</span>, the amorphous Kitaev model retains its exact solubility but the presence of plaquettes with an odd number of sides leads to a spontaneous breaking of time reversal symmetry. I will confirm prior observations that the form of the ground state is relatively simple <span class="citation" data-cites="OBrienPRB2016 eschmannThermodynamicClassificationThreedimensional2020"> [<a href="#ref-OBrienPRB2016" role="doc-biblioref">77</a>,<a href="#ref-eschmannThermodynamicClassificationThreedimensional2020" role="doc-biblioref">107</a>]</span> and unearth a rich phase diagram displaying Abelian as well as a non-Abelian chiral spin liquid phases. Furthermore, I will show that the system undergoes a finite-temperature phase transition to a thermal metal state and discuss possible experimental realisations.</p>
|
||||
<p>The next chapter, <a href="../2_Background/2.1_FK_Model.html">Chapter 2</a>, will introduce some necessary background to the FK model, the KH model, and disorder and localisation.</p>
|
||||
<p>Next Chapter: <a href="../2_Background/2.1_FK_Model.html">2 Background</a></p>
|
||||
</section>
|
||||
@ -477,7 +476,8 @@ H_{\mathrm{FK}} = & -\;t \sum_{\langle i,j \rangle} c^\dagger_{i}c_{j} + \;U
|
||||
<section class="footnotes footnotes-end-of-document" role="doc-endnotes">
|
||||
<hr />
|
||||
<ol>
|
||||
<li id="fn1" role="doc-endnote"><p>In fact the FQH state <em>can</em> be described within a Ginzburg-Landau like paradigm but it requires a non-local order parameter <span class="citation" data-cites="girvinOffdiagonalLongrangeOrder1987 readOrderParameterGinzburgLandau1989"> [<a href="#ref-girvinOffdiagonalLongrangeOrder1987" role="doc-biblioref">51</a>,<a href="#ref-readOrderParameterGinzburgLandau1989" role="doc-biblioref">52</a>]</span>. Phases like this are said to possess <em>topological order</em> defined by the fact that they cannot be smoothly deformed into a product state <span class="citation" data-cites="chenLocalUnitaryTransformation2010 wenQuantumOrdersSymmetric2002"> [<a href="#ref-chenLocalUnitaryTransformation2010" role="doc-biblioref">53</a>,<a href="#ref-wenQuantumOrdersSymmetric2002" role="doc-biblioref">54</a>]</span>.<a href="#fnref1" class="footnote-back" role="doc-backlink">↩︎</a></p></li>
|
||||
<li id="fn1" role="doc-endnote"><p>The Pauli exclusion principle is special in that it can be treated much more simply that other interactions, this relates to the fact that it is a hard constraint.<a href="#fnref1" class="footnote-back" role="doc-backlink">↩︎</a></p></li>
|
||||
<li id="fn2" role="doc-endnote"><p>In fact the FQH state <em>can</em> be described within a Ginzburg-Landau like paradigm but it requires a non-local order parameter <span class="citation" data-cites="girvinOffdiagonalLongrangeOrder1987 readOrderParameterGinzburgLandau1989"> [<a href="#ref-girvinOffdiagonalLongrangeOrder1987" role="doc-biblioref">51</a>,<a href="#ref-readOrderParameterGinzburgLandau1989" role="doc-biblioref">52</a>]</span>. Phases like this are said to possess <em>topological order</em> defined by the fact that they cannot be smoothly deformed into a product state <span class="citation" data-cites="chenLocalUnitaryTransformation2010 wenQuantumOrdersSymmetric2002"> [<a href="#ref-chenLocalUnitaryTransformation2010" role="doc-biblioref">53</a>,<a href="#ref-wenQuantumOrdersSymmetric2002" role="doc-biblioref">54</a>]</span>.<a href="#fnref2" class="footnote-back" role="doc-backlink">↩︎</a></p></li>
|
||||
</ol>
|
||||
</section>
|
||||
|
||||
|
@ -83,32 +83,32 @@ image:
|
||||
<section id="the-model" class="level2">
|
||||
<h2>The Model</h2>
|
||||
<figure>
|
||||
<img src="/assets/thesis/background_chapter/simple_DOS.svg" id="fig-simple_DOS" data-short-caption="Cubic Lattice dispersion with disorder" style="width:100.0%" alt="Figure 1: The dispersion (upper row) and density of states (lower row) obtained from a cubic lattice model H = \sum_{i} V_i c^\dagger_{i}c_{i} - t \sum_{\langle i,j\rangle} c^\dagger_{i}c_{j} in 1D. (a) With no external potential. (b) With a static charge density wave background V_i = (-1)^i (c) A static charge density wave background with 2% binary disorder. The top row shows the analytic dispersion in orange compared with the integral of the DOS in dotted black." />
|
||||
<figcaption aria-hidden="true">Figure 1: The dispersion (upper row) and density of states (lower row) obtained from a cubic lattice model <span class="math inline">\(H = \sum_{i} V_i c^\dagger_{i}c_{i} - t \sum_{\langle i,j\rangle} c^\dagger_{i}c_{j}\)</span> in 1D. (a) With no external potential. (b) With a static charge density wave background <span class="math inline">\(V_i = (-1)^i\)</span> (c) A static charge density wave background with 2% binary disorder. The top row shows the analytic dispersion in orange compared with the integral of the DOS in dotted black.</figcaption>
|
||||
<img src="/assets/thesis/background_chapter/simple_DOS.svg" id="fig-simple_DOS" data-short-caption="Cubic Lattice dispersion with disorder" style="width:100.0%" alt="Figure 1: The dispersion (upper row) and density of states (lower row) obtained from a cubic lattice model H = \sum_{i} V_i c^\dagger_{i}c^{\phantom{\dagger}}_{i} - t \sum_{\langle i,j\rangle} c^\dagger_{i}c^{\phantom{\dagger}}_{j} in 1D. (a) With no external potential. (b) With a static charge density wave background V_i = (-1)^i (c) A static charge density wave background with 2% binary disorder. The top row shows the analytic dispersion in orange compared with the integral of the DOS in dotted black." />
|
||||
<figcaption aria-hidden="true">Figure 1: The dispersion (upper row) and density of states (lower row) obtained from a cubic lattice model <span class="math inline">\(H = \sum_{i} V_i c^\dagger_{i}c^{\phantom{\dagger}}_{i} - t \sum_{\langle i,j\rangle} c^\dagger_{i}c^{\phantom{\dagger}}_{j}\)</span> in 1D. (a) With no external potential. (b) With a static charge density wave background <span class="math inline">\(V_i = (-1)^i\)</span> (c) A static charge density wave background with 2% binary disorder. The top row shows the analytic dispersion in orange compared with the integral of the DOS in dotted black.</figcaption>
|
||||
</figure>
|
||||
<p>The Falicov-Kimball (FK) model is one of the simplest models of the correlated electron problem. It captures the essence of the interaction between itinerant and localised electrons. It was originally introduced to explain the metal-insulator transition in f-electron systems. However, in its long history, the FK model has been interpreted variously as a model of electrons and ions, binary alloys or crystal formation <span class="citation" data-cites="hubbardj.ElectronCorrelationsNarrow1963 falicovSimpleModelSemiconductorMetal1969 gruberFalicovKimballModelReview1996 gruberFalicovKimballModel2006"> [<a href="#ref-hubbardj.ElectronCorrelationsNarrow1963" role="doc-biblioref">1</a>–<a href="#ref-gruberFalicovKimballModel2006" role="doc-biblioref">4</a>]</span>. In terms of immobile fermions <span class="math inline">\(d_i\)</span> and light fermions <span class="math inline">\(c_i\)</span> and with chemical potential fixed at half-filling, the model reads</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
H_{\mathrm{FK}} = & \;U \sum_{i} (d^\dagger_{i}d_{i} - \tfrac{1}{2})\;(c^\dagger_{i}c_{i} - \tfrac{1}{2}) -\;t \sum_{\langle i,j\rangle} c^\dagger_{i}c_{j}.\\
|
||||
H_{\mathrm{FK}} = & \;U \sum_{i} (d^\dagger_{i}d_{i} - \tfrac{1}{2})\;(c^\dagger_{i}c^{\phantom{\dagger}}_{i} - \tfrac{1}{2}) -\;t \sum_{\langle i,j\rangle} c^\dagger_{i}c^{\phantom{\dagger}}_{j}.\\
|
||||
\end{aligned}\]</span></p>
|
||||
<p>Here we will only discuss the hypercubic lattices, i.e the chain, the square lattice, the cubic lattice and so on. The connection to the Hubbard model is that we have relabelled the up and down spin electron states and removed the hopping term for one spin state. This is equivalent to taking the limit of infinite mass ratio <span class="citation" data-cites="devriesSimplifiedHubbardModel1993"> [<a href="#ref-devriesSimplifiedHubbardModel1993" role="doc-biblioref">5</a>]</span>.</p>
|
||||
<p>Like other exactly solvable models <span class="citation" data-cites="smithDisorderFreeLocalization2017"> [<a href="#ref-smithDisorderFreeLocalization2017" role="doc-biblioref">6</a>]</span>, the FK model possesses extensively many conserved degrees of freedom <span class="math inline">\([d^\dagger_{i}d_{i}, H] = 0\)</span>. Similarly, the Kitaev model contains an extensive number of conserved fluxes. So in both models, the Hilbert space breaks up into a set of sectors in which these operators take a definite value. Crucially, this reduces the interaction terms in the model from being quartic in fermion operators to quadratic. This is what makes the two models exactly solvable, in contrast to the Hubbard model. For the FK model the interaction term <span class="math inline">\((d^\dagger_{i}d_{i} - \tfrac{1}{2})\;(c^\dagger_{i}c_{i} - \tfrac{1}{2})\)</span> becomes quadratic when <span class="math inline">\(d^\dagger_{i}d_{i}\)</span> is replaced with one of its eigenvalues <span class="math inline">\(\{0,1\}\)</span>. The same thing happens in the Kitaev model, though after first applying a clever transformation which we will discuss later.</p>
|
||||
<p>Due to Pauli exclusion, maximum filling occurs when each lattice site is fully occupied, <span class="math inline">\(\langle n_c + n_d \rangle = 2\)</span>. Here we will focus on the half filled case <span class="math inline">\(\langle n_c + n_d \rangle = 1\)</span>. The ground state phenomenology as the model is doped away from the half-filled state can be rich <span class="citation" data-cites="jedrzejewskiFalicovKimballModels2001 gruberGroundStatesSpinless1990"> [<a href="#ref-jedrzejewskiFalicovKimballModels2001" role="doc-biblioref">7</a>,<a href="#ref-gruberGroundStatesSpinless1990" role="doc-biblioref">8</a>]</span> but the half-filled point has symmetries that make it particularly interesting. From this point on, we will only consider the half-filled point.</p>
|
||||
<p>At half-filling and on bipartite lattices, the FK model is particle-hole symmetric. That is, the Hamiltonian anticommutes with the particle hole operator <span class="math inline">\(\mathcal{P}H\mathcal{P}^{-1} = -H\)</span>. As a consequence, the energy spectrum is symmetric about <span class="math inline">\(E = 0\)</span>, which is the Fermi energy. The particle hole operator corresponds to the substitution <span class="math inline">\(c^\dagger_i \rightarrow \epsilon_i c_i, d^\dagger_i \rightarrow d_i\)</span> where <span class="math inline">\(\epsilon_i = +1\)</span> for the A sublattice and <span class="math inline">\(-1\)</span> for the B sublattice <span class="citation" data-cites="gruberFalicovKimballModel2006"> [<a href="#ref-gruberFalicovKimballModel2006" role="doc-biblioref">4</a>]</span>. The absence of a hopping term for the heavy electrons means they do not need the factor of <span class="math inline">\(\epsilon_i\)</span> but they would need it in the corresponding Hubbard model. See appendix <a href="../6_Appendices/A.1_Particle_Hole_Symmetry.html#particle-hole-symmetry">A.1</a> for a full derivation of the particle-hole symmetry.</p>
|
||||
<p>We will later add a long-range interaction between the localised electrons. At that point we will replace the immobile fermions with a classical Ising field <span class="math inline">\(S_i = 1 - 2d^\dagger_id_i = \pm1\)</span> which I will refer to as the spins.</p>
|
||||
<p>Here we will only discuss the hypercubic lattices, i.e., the chain, the square lattice, the cubic lattice and so on. The connection to the Hubbard model is that we have relabelled the up and down spin electron states and removed the hopping term for one spin state. This is equivalent to taking the limit of infinite mass ratio <span class="citation" data-cites="devriesSimplifiedHubbardModel1993"> [<a href="#ref-devriesSimplifiedHubbardModel1993" role="doc-biblioref">5</a>]</span>.</p>
|
||||
<p>Like other exactly solvable models <span class="citation" data-cites="smithDisorderFreeLocalization2017"> [<a href="#ref-smithDisorderFreeLocalization2017" role="doc-biblioref">6</a>]</span>, the FK model possesses extensively many conserved degrees of freedom <span class="math inline">\([d^\dagger_{i}d_{i}, H] = 0\)</span>. Similarly, the Kitaev model contains an extensive number of conserved fluxes. So in both models, the Hilbert space breaks up into a set of sectors in which these operators take a definite value. Crucially, this reduces the interaction terms in the model from being quartic in fermion operators to quadratic. This is what makes the two models exactly solvable, in contrast to the Hubbard model. For the FK model the interaction term <span class="math inline">\((d^\dagger_{i}d_{i} - \tfrac{1}{2})\;(c^\dagger_{i}c^{\phantom{\dagger}}_{i} - \tfrac{1}{2})\)</span> becomes quadratic when <span class="math inline">\(d^\dagger_{i}d_{i}\)</span> is replaced with one of its eigenvalues <span class="math inline">\(\{0,1\}\)</span>. The same thing happens in the Kitaev model, though after first applying a clever transformation which we will discuss later.</p>
|
||||
<p>Due to Pauli exclusion, maximum filling occurs when each lattice site is fully occupied, <span class="math inline">\(\langle n_c + n_d \rangle = 2\)</span>. Here we will focus on the half-filled case <span class="math inline">\(\langle n_c + n_d \rangle = 1\)</span>. The ground state phenomenology as the model is doped away from the half-filled state can be rich <span class="citation" data-cites="jedrzejewskiFalicovKimballModels2001 gruberGroundStatesSpinless1990"> [<a href="#ref-jedrzejewskiFalicovKimballModels2001" role="doc-biblioref">7</a>,<a href="#ref-gruberGroundStatesSpinless1990" role="doc-biblioref">8</a>]</span> but the half-filled point has symmetries that make it particularly interesting. From this point on, we will only consider the half-filled point.</p>
|
||||
<p>At half-filling and on bipartite lattices, the FK model is particle-hole symmetric. That is, the Hamiltonian anticommutes with the particle hole operator <span class="math inline">\(\mathcal{P}H\mathcal{P}^{-1} = -H\)</span>. As a consequence, the energy spectrum is symmetric about <span class="math inline">\(E = 0\)</span>, which is the Fermi energy. Fig. <a href="#fig:simple_DOS">1</a> shows this in action, in the presence of a periodic potential a gap in the energy spectrum opens symmetrically about <span class="math inline">\(E = 0\)</span>. The particle hole operator corresponds to the substitution <span class="math inline">\(c^\dagger_i \rightarrow \epsilon_i c_i, d^\dagger_i \rightarrow d_i\)</span> where <span class="math inline">\(\epsilon_i = +1\)</span> for the A sublattice and <span class="math inline">\(-1\)</span> for the B sublattice <span class="citation" data-cites="gruberFalicovKimballModel2006"> [<a href="#ref-gruberFalicovKimballModel2006" role="doc-biblioref">4</a>]</span>. The absence of a hopping term for the heavy electrons means they do not need the factor of <span class="math inline">\(\epsilon_i\)</span> but they would need it in the corresponding Hubbard model. See appendix <a href="../6_Appendices/A.1_Particle_Hole_Symmetry.html#particle-hole-symmetry">A.1</a> for a full derivation of the particle-hole symmetry.</p>
|
||||
<p>We will later add a long-range interaction between the localised electrons. At that point we will replace the immobile fermions with a classical Ising field <span class="math inline">\(S_i = \tfrac{1}{2}(1 - 2d^\dagger_id_i) = \pm\tfrac{1}{2}\)</span> which I will refer to as the spins.</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
H_{\mathrm{FK}} = & \;U \sum_{i} S_i\;(c^\dagger_{i}c_{i} - \tfrac{1}{2}) -\;t \sum_{\langle i,j\rangle} c^\dagger_{i}c_{j}.\\
|
||||
H_{\mathrm{FK}} = & \;U \sum_{i} S_i\;(c^\dagger_{i}c^{\phantom{\dagger}}_{i} - \tfrac{1}{2}) -\;t \sum_{\langle i,j\rangle} c^\dagger_{i}c^{\phantom{\dagger}}_{j}.\\
|
||||
\end{aligned}\]</span></p>
|
||||
<p>The FK model can be solved exactly with dynamic mean field theory in the infinite dimensional limit <span class="citation" data-cites="antipovCriticalExponentsStrongly2014 ribicNonlocalCorrelationsSpectral2016 freericksExactDynamicalMeanfield2003 herrmannNonequilibriumDynamicalCluster2016"> [<a href="#ref-antipovCriticalExponentsStrongly2014" role="doc-biblioref">9</a>–<a href="#ref-herrmannNonequilibriumDynamicalCluster2016" role="doc-biblioref">12</a>]</span>. In lower dimensional systems it has radically different behaviour, as we shall see.</p>
|
||||
</section>
|
||||
<section id="phase-diagrams" class="level2">
|
||||
<h2>Phase Diagrams</h2>
|
||||
<figure>
|
||||
<img src="/assets/thesis/background_chapter/fk_phase_diagram.svg" id="fig-fk_phase_diagram" data-short-caption="Falicov-Kimball Temperature-Interaction Phase Diagrams" style="width:100.0%" alt="Figure 2: Schematic Phase diagram of the Falicov-Kimball model in dimensions greater than two. At low temperature the classical fermions (spins) settle into an ordered charge density wave state (antiferromagnetic state). The schematic diagram for the Hubbard model is the same. Reproduced from [9,13]." />
|
||||
<figcaption aria-hidden="true">Figure 2: Schematic Phase diagram of the Falicov-Kimball model in dimensions greater than two. At low temperature the classical fermions (spins) settle into an ordered charge density wave state (antiferromagnetic state). The schematic diagram for the Hubbard model is the same. Reproduced from <span class="citation" data-cites="antipovInteractionTunedAndersonMott2016 antipovCriticalExponentsStrongly2014"> [<a href="#ref-antipovCriticalExponentsStrongly2014" role="doc-biblioref">9</a>,<a href="#ref-antipovInteractionTunedAndersonMott2016" role="doc-biblioref">13</a>]</span>.</figcaption>
|
||||
<img src="/assets/thesis/background_chapter/fk_phase_diagram.svg" id="fig-fk_phase_diagram" data-short-caption="Falicov-Kimball Temperature-Interaction Phase Diagrams" style="width:100.0%" alt="Figure 2: Schematic Phase diagram of the Falicov-Kimball model in dimensions two or more At low temperature the classical fermions (spins) settle into an ordered charge density wave state (antiferromagnetic state). The schematic diagram for the Hubbard model is the same. Reproduced from [9,13]." />
|
||||
<figcaption aria-hidden="true">Figure 2: Schematic Phase diagram of the Falicov-Kimball model in dimensions two or more At low temperature the classical fermions (spins) settle into an ordered charge density wave state (antiferromagnetic state). The schematic diagram for the Hubbard model is the same. Reproduced from <span class="citation" data-cites="antipovInteractionTunedAndersonMott2016 antipovCriticalExponentsStrongly2014"> [<a href="#ref-antipovCriticalExponentsStrongly2014" role="doc-biblioref">9</a>,<a href="#ref-antipovInteractionTunedAndersonMott2016" role="doc-biblioref">13</a>]</span>.</figcaption>
|
||||
</figure>
|
||||
<p>In dimensions greater than one, the FK model exhibits a phase transition at some <span class="math inline">\(U\)</span> dependent critical temperature <span class="math inline">\(T_c(U)\)</span> to a low temperature ordered phase <span class="citation" data-cites="maskaThermodynamicsTwodimensionalFalicovKimball2006"> [<a href="#ref-maskaThermodynamicsTwodimensionalFalicovKimball2006" role="doc-biblioref">14</a>]</span>. In terms of the heavy electrons this corresponds to them occupying only one of the two sublattices A and B, known as a Charge Density Wave (CDW) phase. In terms of spins, this is an antiferromagnetic phase.</p>
|
||||
<p>In the disordered region above <span class="math inline">\(T_c(U)\)</span>, there are two insulating phases. For weak interactions <span class="math inline">\(U << t\)</span>, thermal fluctuations in the spins act as an effective disorder potential for the fermions. This causes them to localise, giving rise to an Anderson insulating (AI) phase <span class="citation" data-cites="andersonAbsenceDiffusionCertain1958"> [<a href="#ref-andersonAbsenceDiffusionCertain1958" role="doc-biblioref">15</a>]</span> which we will discuss more in section <a href="../2_Background/2.4_Disorder.html#bg-disorder-and-localisation">2.3</a>. For strong interactions <span class="math inline">\(U >> t\)</span>, the spins are not ordered. Nevertheless, their interaction with the electrons opens a gap, leading to a Mott insulator analogous to that of the Hubbard model <span class="citation" data-cites="brandtThermodynamicsCorrelationFunctions1989"> [<a href="#ref-brandtThermodynamicsCorrelationFunctions1989" role="doc-biblioref">16</a>]</span>. The presence of an interaction driven phase like the Mott insulator in an exactly solvable model is part of what makes the FK model such an interesting system.</p>
|
||||
<p>By contrast, in the 1D FK model there is no Finite-Temperature Phase Transition (FTPT) to an ordered CDW phase <span class="citation" data-cites="liebAbsenceMottTransition1968"> [<a href="#ref-liebAbsenceMottTransition1968" role="doc-biblioref">17</a>]</span>. Indeed, dimensionality is crucial for the physics of both localisation and FTPTs. In 1D, disorder generally dominates: even the weakest disorder exponentially localises <em>all</em> single particle eigenstates. In the 1D FK model, this means that the whole spectrum is localised at all finite temperatures <span class="citation" data-cites="goldshteinPurePointSpectrum1977 abrahamsScalingTheoryLocalization1979 kramerLocalizationTheoryExperiment1993"> [<a href="#ref-goldshteinPurePointSpectrum1977" role="doc-biblioref">18</a>–<a href="#ref-kramerLocalizationTheoryExperiment1993" role="doc-biblioref">20</a>]</span>. Although at low temperatures, the localisation length may be so large that the states appear extended in finite sized systems <span class="citation" data-cites="antipovInteractionTunedAndersonMott2016"> [<a href="#ref-antipovInteractionTunedAndersonMott2016" role="doc-biblioref">13</a>]</span>. Only longer-range correlations of the disorder potential can potentially induce localisation-delocalisation transitions in 1D <span class="citation" data-cites="aubryAnalyticityBreakingAnderson1980 dassarmaLocalizationMobilityEdges1990 dunlapAbsenceLocalizationRandomdimer1990"> [<a href="#ref-aubryAnalyticityBreakingAnderson1980" role="doc-biblioref">21</a>–<a href="#ref-dunlapAbsenceLocalizationRandomdimer1990" role="doc-biblioref">23</a>]</span></p>
|
||||
<p>By contrast, in the 1D FK model there is no Finite-Temperature Phase Transition (FTPT) to an ordered CDW phase <span class="citation" data-cites="liebAbsenceMottTransition1968"> [<a href="#ref-liebAbsenceMottTransition1968" role="doc-biblioref">17</a>]</span>. Indeed, dimensionality is crucial for the physics of both localisation and FTPTs. In 1D, disorder generally dominates: even the weakest disorder exponentially localises <em>all</em> single particle eigenstates. In the 1D FK model, this means that the whole spectrum is localised at all finite temperatures <span class="citation" data-cites="goldshteinPurePointSpectrum1977 abrahamsScalingTheoryLocalization1979 kramerLocalizationTheoryExperiment1993"> [<a href="#ref-goldshteinPurePointSpectrum1977" role="doc-biblioref">18</a>–<a href="#ref-kramerLocalizationTheoryExperiment1993" role="doc-biblioref">20</a>]</span>. Although at low temperatures, the localisation length may be so large that the states appear extended in finite sized systems <span class="citation" data-cites="antipovInteractionTunedAndersonMott2016"> [<a href="#ref-antipovInteractionTunedAndersonMott2016" role="doc-biblioref">13</a>]</span>. Only longer-range correlations of the disorder potential can potentially induce localisation-delocalisation transitions in 1D <span class="citation" data-cites="aubryAnalyticityBreakingAnderson1980 dassarmaLocalizationMobilityEdges1990 dunlapAbsenceLocalizationRandomdimer1990"> [<a href="#ref-aubryAnalyticityBreakingAnderson1980" role="doc-biblioref">21</a>–<a href="#ref-dunlapAbsenceLocalizationRandomdimer1990" role="doc-biblioref">23</a>]</span>.</p>
|
||||
<p>The absence of finite temperature ordered phases in 1D systems is a general feature. It can be understood as a consequence of the fact that domain walls are energetically cheap in 1D. Thermodynamically, short-range interactions just cannot overcome the entropy of thermal defects in 1D. However, the addition of longer range interactions can overcome this <span class="citation" data-cites="peierlsIsingModelFerromagnetism1936 kennedyItinerantElectronModel1986"> [<a href="#ref-peierlsIsingModelFerromagnetism1936" role="doc-biblioref">24</a>,<a href="#ref-kennedyItinerantElectronModel1986" role="doc-biblioref">25</a>]</span>.</p>
|
||||
<p>The absence of an FTPT in the short ranged FK chain is far from obvious because the Ruderman-Kittel-Kasuya-Yosida (RKKY) interaction mediated by the fermions <span class="citation" data-cites="kasuyaTheoryMetallicFerro1956 rudermanIndirectExchangeCoupling1954 vanvleckNoteInteractionsSpins1962 yosidaMagneticPropertiesCuMn1957"> [<a href="#ref-kasuyaTheoryMetallicFerro1956" role="doc-biblioref">26</a>–<a href="#ref-yosidaMagneticPropertiesCuMn1957" role="doc-biblioref">29</a>]</span> decays as <span class="math inline">\(r^{-1}\)</span> in 1D <span class="citation" data-cites="rusinCalculationRKKYRange2017"> [<a href="#ref-rusinCalculationRKKYRange2017" role="doc-biblioref">30</a>]</span>. This could, in principle, induce the necessary long-range interactions for the classical Ising background to order at low temperatures <span class="citation" data-cites="thoulessLongRangeOrderOneDimensional1969 peierlsIsingModelFerromagnetism1936"> [<a href="#ref-peierlsIsingModelFerromagnetism1936" role="doc-biblioref">24</a>,<a href="#ref-thoulessLongRangeOrderOneDimensional1969" role="doc-biblioref">31</a>]</span>. However, Kennedy and Lieb established rigorously that, at half-filling, a CDW phase only exists at <span class="math inline">\(T = 0\)</span> for the 1D FK model <span class="citation" data-cites="kennedyItinerantElectronModel1986"> [<a href="#ref-kennedyItinerantElectronModel1986" role="doc-biblioref">25</a>]</span>.</p>
|
||||
<p>The 1D FK model has been studied numerically, perturbatively in interaction strength <span class="math inline">\(U\)</span> and in the continuum limit <span class="citation" data-cites="bursillOneDimensionalContinuum1994"> [<a href="#ref-bursillOneDimensionalContinuum1994" role="doc-biblioref">32</a>]</span>. The main results are that for attractive <span class="math inline">\(U > U_c\)</span> the system forms electron spin bound state ‘atoms’ which repel one another <span class="citation" data-cites="gruberGroundStateEnergyLowTemperature1993"> [<a href="#ref-gruberGroundStateEnergyLowTemperature1993" role="doc-biblioref">33</a>]</span> and that the ground state phase diagram has a fractal structure as a function of electron filling, a devil’s staircase <span class="citation" data-cites="freericksTwostateOnedimensionalSpinless1990 michelettiCompleteDevilStaircase1997"> [<a href="#ref-freericksTwostateOnedimensionalSpinless1990" role="doc-biblioref">34</a>,<a href="#ref-michelettiCompleteDevilStaircase1997" role="doc-biblioref">35</a>]</span>.</p>
|
||||
@ -117,7 +117,7 @@ H_{\mathrm{FK}} = & \;U \sum_{i} S_i\;(c^\dagger_{i}c_{i} - \tfrac{1}{2}) -\
|
||||
<section id="long-ranged-ising-model" class="level2">
|
||||
<h2>Long-Ranged Ising model</h2>
|
||||
<p>The suppression of phase transitions is a common phenomenon in 1D systems and the Ising model serves as the canonical illustration of this. In terms of classical spins <span class="math inline">\(S_i = \pm 1\)</span> the standard Ising model reads</p>
|
||||
<p><span class="math display">\[H_{\mathrm{I}} = \sum_{\langle ij \rangle} S_i S_j\]</span></p>
|
||||
<p><span class="math display">\[H_{\mathrm{I}} = \sum_{\langle ij \rangle} S_i S_j.\]</span></p>
|
||||
<p>Like the FK model, the Ising model shows an FTPT to an ordered state only in 2D and above. This can be understood via Peierls’ argument <span class="citation" data-cites="peierlsIsingModelFerromagnetism1936 kennedyItinerantElectronModel1986"> [<a href="#ref-peierlsIsingModelFerromagnetism1936" role="doc-biblioref">24</a>,<a href="#ref-kennedyItinerantElectronModel1986" role="doc-biblioref">25</a>]</span> to be a consequence of the low energy penalty for domain walls in 1D systems.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/intro_chapter/ising_model_domain_wall.svg" id="fig-ising_model_domain_wall" data-short-caption="Domain walls in the long-range Ising Model" style="width:100.0%" alt="Figure 3: Domain walls in the 1D Ising model cost finite energy because they affect only one interaction. In the Long-Range Ising (LRI) model it depends on how the interactions decay with distance." />
|
||||
@ -125,16 +125,16 @@ H_{\mathrm{FK}} = & \;U \sum_{i} S_i\;(c^\dagger_{i}c_{i} - \tfrac{1}{2}) -\
|
||||
</figure>
|
||||
<p>Following Peierls’ argument, consider the difference in free energy <span class="math inline">\(\Delta F = \Delta E - T\Delta S\)</span> between an ordered state and a state with single domain wall as in fig. <a href="#fig:ising_model_domain_wall">3</a>. If this value is negative, it implies that the ordered state is unstable with respect to domain wall defects, and they will thus proliferate, destroying the ordered phase. If we consider the scaling of the two terms with system size <span class="math inline">\(L\)</span>, we see that short range interactions produce a constant energy penalty <span class="math inline">\(\Delta E\)</span> for a domain wall. In contrast, the number of such single domain wall states scales linearly with system size so the entropy is <span class="math inline">\(\propto \ln L\)</span>. Thus, the entropic contribution dominates (eventually) in the thermodynamic limit and no finite temperature order is possible. In 2D and above, the energy penalty of a large domain wall scales like <span class="math inline">\(L^{d-1}\)</span> which is why they can support ordered phases. This argument does not quite apply to the FK model because of the aforementioned RKKY interaction. Instead, this argument will give us insight into how to recover an ordered phase in the 1D FK model.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/background_chapter/alpha_diagram.svg" id="fig-alpha_diagram" data-short-caption="Long-Range Ising Model Behaviour" style="width:100.0%" alt="Figure 4: The thermodynamic behaviour of the long-range Ising model H_{\mathrm{LRI}} = J \sum_{i\neq j} |i - j|^{-\alpha} S_i S_j as the exponent of the interaction \alpha is varied. In my simulations I stick to a value of \alpha = \tfrac{5}{4} the complexity of non-universal critical exponents." />
|
||||
<figcaption aria-hidden="true">Figure 4: The thermodynamic behaviour of the long-range Ising model <span class="math inline">\(H_{\mathrm{LRI}} = J \sum_{i\neq j} |i - j|^{-\alpha} S_i S_j\)</span> as the exponent of the interaction <span class="math inline">\(\alpha\)</span> is varied. In my simulations I stick to a value of <span class="math inline">\(\alpha = \tfrac{5}{4}\)</span> the complexity of non-universal critical exponents.</figcaption>
|
||||
<img src="/assets/thesis/background_chapter/alpha_diagram.svg" id="fig-alpha_diagram" data-short-caption="Long-Range Ising Model Behaviour" style="width:100.0%" alt="Figure 4: The thermodynamic behaviour of the long-range Ising model H_{\mathrm{LRI}} = J \sum_{i\neq j} |i - j|^{-\alpha} S_i S_j as the exponent of the interaction \alpha is varied. In my simulations I stick to a value of \alpha = \tfrac{5}{4} to avoid the complexity of non-universal critical exponents that arise above \alpha = \tfrac{3}{2}" />
|
||||
<figcaption aria-hidden="true">Figure 4: The thermodynamic behaviour of the long-range Ising model <span class="math inline">\(H_{\mathrm{LRI}} = J \sum_{i\neq j} |i - j|^{-\alpha} S_i S_j\)</span> as the exponent of the interaction <span class="math inline">\(\alpha\)</span> is varied. In my simulations I stick to a value of <span class="math inline">\(\alpha = \tfrac{5}{4}\)</span> to avoid the complexity of non-universal critical exponents that arise above <span class="math inline">\(\alpha = \tfrac{3}{2}\)</span></figcaption>
|
||||
</figure>
|
||||
<p>In contrast, the LRI model <span class="math inline">\(H_{\mathrm{LRI}}\)</span> can have an FTPT in 1D.</p>
|
||||
<p><span class="math display">\[H_{\mathrm{LRI}} = \sum_{ij} J(|i-j|) S_i S_j = J \sum_{i\neq j} |i - j|^{-\alpha} S_i S_j\]</span></p>
|
||||
<p>Renormalisation group analyses show that the LRI model has an ordered phase in 1D for <span class="math inline">\(1 < \alpha < 2\)</span> <span class="citation" data-cites="dysonExistencePhasetransitionOnedimensional1969"> [<a href="#ref-dysonExistencePhasetransitionOnedimensional1969" role="doc-biblioref">36</a>]</span>. Peierls’ argument can be extended <span class="citation" data-cites="thoulessLongRangeOrderOneDimensional1969"> [<a href="#ref-thoulessLongRangeOrderOneDimensional1969" role="doc-biblioref">31</a>]</span> to long-range interactions to provide intuition for why this is the case. Let’s consider again the energy difference between the ordered state <span class="math inline">\(|\ldots\uparrow\uparrow\uparrow\uparrow\ldots\rangle\)</span> and a domain wall state <span class="math inline">\(|\ldots\uparrow\uparrow\downarrow\downarrow\ldots\rangle\)</span>. In the case of the LRI model, careful counting shows that this energy penalty is <span id="eq:bg-dw-penalty"><span class="math display">\[\Delta E \propto \sum_{n=1}^{\infty} n J(n)\qquad{(1)}\]</span></span></p>
|
||||
<p><span class="math display">\[H_{\mathrm{LRI}} = \sum_{ij} J(|i-j|) S_i S_j = J \sum_{i\neq j} |i - j|^{-\alpha} S_i S_j.\]</span></p>
|
||||
<p>Renormalisation group analyses show that the LRI model has an ordered phase in 1D for <span class="math inline">\(1 < \alpha < 2\)</span> <span class="citation" data-cites="dysonExistencePhasetransitionOnedimensional1969"> [<a href="#ref-dysonExistencePhasetransitionOnedimensional1969" role="doc-biblioref">36</a>]</span>. Peierls’ argument can be extended <span class="citation" data-cites="thoulessLongRangeOrderOneDimensional1969"> [<a href="#ref-thoulessLongRangeOrderOneDimensional1969" role="doc-biblioref">31</a>]</span> to long-range interactions to provide intuition for why this is the case. Let’s consider again the energy difference between the ordered state <span class="math inline">\(|\ldots\uparrow\uparrow\uparrow\uparrow\ldots\rangle\)</span> and a domain wall state <span class="math inline">\(|\ldots\uparrow\uparrow\downarrow\downarrow\ldots\rangle\)</span>. In the case of the LRI model, careful counting shows that this energy penalty is <span id="eq:bg-dw-penalty"><span class="math display">\[\Delta E \propto \sum_{n=1}^{\infty} n J(n),\qquad{(1)}\]</span></span></p>
|
||||
<p>because each interaction between spins separated across the domain by a bond length <span class="math inline">\(n\)</span> can be drawn between <span class="math inline">\(n\)</span> equivalent pairs of sites. The behaviour then depends crucially on how eq. <a href="#eq:bg-dw-penalty">1</a> scales with system size. Ruelle proved rigorously for a very general class of 1D systems that if <span class="math inline">\(\Delta E\)</span> or its many-body generalisation converges to a constant in the thermodynamic limit then the free energy is analytic <span class="citation" data-cites="ruelleStatisticalMechanicsOnedimensional1968"> [<a href="#ref-ruelleStatisticalMechanicsOnedimensional1968" role="doc-biblioref">37</a>]</span>. This rules out a finite order phase transition, though not one of the Kosterlitz-Thouless type. Dyson also proved this, though with a slightly different condition on <span class="math inline">\(J(n)\)</span> <span class="citation" data-cites="dysonExistencePhasetransitionOnedimensional1969"> [<a href="#ref-dysonExistencePhasetransitionOnedimensional1969" role="doc-biblioref">36</a>]</span>.</p>
|
||||
<p>With a power law form for <span class="math inline">\(J(n)\)</span>, there are a few cases to consider: For <span class="math inline">\(\alpha = 0\)</span> i.e infinite range interactions, the Ising model is exactly solvable and mean field theory is exact <span class="citation" data-cites="lipkinValidityManybodyApproximation1965"> [<a href="#ref-lipkinValidityManybodyApproximation1965" role="doc-biblioref">38</a>]</span>. This limit is the same as the infinite dimensional limit. For <span class="math inline">\(\alpha \leq 1\)</span> we have very slowly decaying interactions. <span class="math inline">\(\Delta E\)</span> does not converge as a function of system size so the Hamiltonian is non-extensive, a topic not without some considerable controversy <span class="citation" data-cites="grossNonextensiveHamiltonianSystems2002 lutskoQuestioningValidityNonextensive2011 wangCommentNonextensiveHamiltonian2003"> [<a href="#ref-grossNonextensiveHamiltonianSystems2002" role="doc-biblioref">39</a>–<a href="#ref-wangCommentNonextensiveHamiltonian2003" role="doc-biblioref">41</a>]</span> that I will not consider further here. For <span class="math inline">\(1 < \alpha < 2\)</span>, we get a phase transition to an ordered state at a finite temperature. This is what we want! At the special point <span class="math inline">\(\alpha = 2\)</span>, the energy of domain walls diverges logarithmically. This turns out to be a Kostelitz-Thouless transition <span class="citation" data-cites="thoulessLongRangeOrderOneDimensional1969"> [<a href="#ref-thoulessLongRangeOrderOneDimensional1969" role="doc-biblioref">31</a>]</span>. Finally, for <span class="math inline">\(\alpha > 2\)</span> we have very quickly decaying interactions and domain walls again have a finite energy penalty. Hence, Peirels’ argument holds and there is no phase transition.</p>
|
||||
<p>With a power law form for <span class="math inline">\(J(n)\)</span>, there are a few cases to consider: For <span class="math inline">\(\alpha = 0\)</span> i.e., infinite range interactions, the Ising model is exactly solvable and mean field theory is exact <span class="citation" data-cites="lipkinValidityManybodyApproximation1965"> [<a href="#ref-lipkinValidityManybodyApproximation1965" role="doc-biblioref">38</a>]</span>. This limit is the same as the infinite dimensional limit. For <span class="math inline">\(\alpha \leq 1\)</span> we have very slowly decaying interactions. <span class="math inline">\(\Delta E\)</span> does not converge as a function of system size so the Hamiltonian is non-extensive, a topic not without some considerable controversy <span class="citation" data-cites="grossNonextensiveHamiltonianSystems2002 lutskoQuestioningValidityNonextensive2011 wangCommentNonextensiveHamiltonian2003"> [<a href="#ref-grossNonextensiveHamiltonianSystems2002" role="doc-biblioref">39</a>–<a href="#ref-wangCommentNonextensiveHamiltonian2003" role="doc-biblioref">41</a>]</span> that I will not consider further here. For <span class="math inline">\(1 < \alpha < 2\)</span>, we get a phase transition to an ordered state at a finite temperature. This is what we want! At the special point <span class="math inline">\(\alpha = 2\)</span>, the energy of domain walls diverges logarithmically. This turns out to be a Kostelitz-Thouless transition <span class="citation" data-cites="thoulessLongRangeOrderOneDimensional1969"> [<a href="#ref-thoulessLongRangeOrderOneDimensional1969" role="doc-biblioref">31</a>]</span>. Finally, for <span class="math inline">\(\alpha > 2\)</span> we have very quickly decaying interactions and domain walls again have a finite energy penalty. Hence, Peirels’ argument holds and there is no phase transition.</p>
|
||||
<p>One final complexity is that for <span class="math inline">\(\tfrac{3}{2} < \alpha < 2\)</span> renormalisation group methods show that the critical point has non-universal critical exponents that depend on <span class="math inline">\(\alpha\)</span> <span class="citation" data-cites="fisherCriticalExponentsLongRange1972"> [<a href="#ref-fisherCriticalExponentsLongRange1972" role="doc-biblioref">42</a>]</span>. To avoid this potential confounding factor we will park ourselves at <span class="math inline">\(\alpha = 1.25\)</span> when we apply these ideas to the FK model.</p>
|
||||
<p>Were we to extend this to arbitrary dimension <span class="math inline">\(d\)</span>, we would find that thermodynamics properties, generally both <span class="math inline">\(d\)</span> and <span class="math inline">\(\alpha\)</span>, long-range interactions can modify the ‘effective dimension’ of thermodynamic systems <span class="citation" data-cites="angeliniRelationsShortrangeLongrange2014"> [<a href="#ref-angeliniRelationsShortrangeLongrange2014" role="doc-biblioref">43</a>]</span>.</p>
|
||||
<p>Were we to extend this to arbitrary dimension <span class="math inline">\(d\)</span>, we would find that, in general, both <span class="math inline">\(d\)</span> and <span class="math inline">\(\alpha\)</span> affect the thermodynamic properties of the model. Long-range interactions essentially modify the ‘effective dimension’ of thermodynamic system <span class="citation" data-cites="angeliniRelationsShortrangeLongrange2014"> [<a href="#ref-angeliniRelationsShortrangeLongrange2014" role="doc-biblioref">43</a>]</span>.</p>
|
||||
<p>Next Section: <a href="../2_Background/2.2_HKM_Model.html">The Kitaev Honeycomb Model</a></p>
|
||||
</section>
|
||||
</section>
|
||||
|
@ -84,15 +84,15 @@ image:
|
||||
<section id="bg-hkm-model" class="level1">
|
||||
<h1>The Kitaev Honeycomb Model</h1>
|
||||
<figure>
|
||||
<img src="/assets/thesis/amk_chapter/intro/honeycomb_zoom/intro_figure_by_hand.svg" id="fig-intro_figure_by_hand" data-short-caption="The Kitaev Honeycomb Model" style="width:100.0%" alt="Figure 1: (a) The Kitaev honeycomb model is defined on a honeycomb lattice. The special feature of the honeycomb lattice that makes the model solvable is that each vertex is joined by exactly three bonds, i.e. the lattice is trivalent. One of three labels is assigned to each bond \{x,\;y,\;z\} here represented by colour. (b) After transforming to the Majorana representation we get an emergent gauge degree of freedom u_{jk} = \pm 1 that lives on each bond, the bond variables. These are antisymmetric, u_{jk} = -u_{kj}, so we represent them graphically with arrows on each bond that point in the direction that u_{jk} = +1. (c) The Majorana transformation can be visualised as breaking each spin into four Majoranas b_i^x,\;b_i^y,\;b_i^z,\;c_i. The x, y and z Majoranas then pair along the bonds forming conserved \mathbb{Z}_2 bond operators u_{jk} = \langle i b_i^\alpha b_j^\alpha \rangle. The remaining c_i operators form an effective quadratic Hamiltonian H = \frac{i}{4} \sum_{\langle i,j\rangle_\alpha} 2J^{\alpha} u_{ij} \hat{c}_i \hat{c}_j." />
|
||||
<figcaption aria-hidden="true">Figure 1: <strong>(a)</strong> The Kitaev honeycomb model is defined on a honeycomb lattice. The special feature of the honeycomb lattice that makes the model solvable is that each vertex is joined by exactly three bonds, i.e. the lattice is trivalent. One of three labels is assigned to each bond <span class="math inline">\(\{x,\;y,\;z\}\)</span> here represented by colour. <strong>(b)</strong> After transforming to the Majorana representation we get an emergent gauge degree of freedom <span class="math inline">\(u_{jk} = \pm 1\)</span> that lives on each bond, the bond variables. These are antisymmetric, <span class="math inline">\(u_{jk} = -u_{kj}\)</span>, so we represent them graphically with arrows on each bond that point in the direction that <span class="math inline">\(u_{jk} = +1\)</span>. <strong>(c)</strong> The Majorana transformation can be visualised as breaking each spin into four Majoranas <span class="math inline">\(b_i^x,\;b_i^y,\;b_i^z,\;c_i\)</span>. The x, y and z Majoranas then pair along the bonds forming conserved <span class="math inline">\(\mathbb{Z}_2\)</span> bond operators <span class="math inline">\(u_{jk} = \langle i b_i^\alpha b_j^\alpha \rangle\)</span>. The remaining <span class="math inline">\(c_i\)</span> operators form an effective quadratic Hamiltonian <span class="math inline">\(H = \frac{i}{4} \sum_{\langle i,j\rangle_\alpha} 2J^{\alpha} u_{ij} \hat{c}_i \hat{c}_j\)</span>.</figcaption>
|
||||
<img src="/assets/thesis/amk_chapter/intro/honeycomb_zoom/intro_figure_by_hand.svg" id="fig-intro_figure_by_hand" data-short-caption="The Kitaev Honeycomb Model" style="width:100.0%" alt="Figure 1: (a) The Kitaev honeycomb model is defined on a honeycomb lattice. The special feature of the honeycomb lattice that makes the model solvable is that each vertex is joined by exactly three bonds, i.e., the lattice is trivalent. One of three labels is assigned to each bond \{x,\;y,\;z\} here represented by colour. (b) After transforming to the Majorana representation we get an emergent gauge degree of freedom u_{jk} = \pm 1 that lives on each bond, the bond variables. These are antisymmetric, u_{jk} = -u_{kj}, so we represent them graphically with arrows on each bond that point in the direction that u_{jk} = +1. (c) The Majorana transformation, discussed later in the main text, can be visualised as breaking each spin into four Majoranas b_i^x,\;b_i^y,\;b_i^z,\;c_i. The b_i^x,\;b_i^y and b_i^z Majoranas then pair along the bonds forming conserved \mathbb{Z}_2 bond operators u_{jk} = \langle i b_i^\alpha b_j^\alpha \rangle. The remaining c_i operators form an effective quadratic Hamiltonian H = \frac{i}{4} \sum_{\langle i,j\rangle_\alpha} 2J^{\alpha} u_{ij} \hat{c}_i \hat{c}_j." />
|
||||
<figcaption aria-hidden="true">Figure 1: <strong>(a)</strong> The Kitaev honeycomb model is defined on a honeycomb lattice. The special feature of the honeycomb lattice that makes the model solvable is that each vertex is joined by exactly three bonds, i.e., the lattice is trivalent. One of three labels is assigned to each bond <span class="math inline">\(\{x,\;y,\;z\}\)</span> here represented by colour. <strong>(b)</strong> After transforming to the Majorana representation we get an emergent gauge degree of freedom <span class="math inline">\(u_{jk} = \pm 1\)</span> that lives on each bond, the bond variables. These are antisymmetric, <span class="math inline">\(u_{jk} = -u_{kj}\)</span>, so we represent them graphically with arrows on each bond that point in the direction that <span class="math inline">\(u_{jk} = +1\)</span>. <strong>(c)</strong> The Majorana transformation, discussed later in the main text, can be visualised as breaking each spin into four Majoranas <span class="math inline">\(b_i^x,\;b_i^y,\;b_i^z,\;c_i\)</span>. The <span class="math inline">\(b_i^x,\;b_i^y\)</span> and <span class="math inline">\(b_i^z\)</span> Majoranas then pair along the bonds forming conserved <span class="math inline">\(\mathbb{Z}_2\)</span> bond operators <span class="math inline">\(u_{jk} = \langle i b_i^\alpha b_j^\alpha \rangle\)</span>. The remaining <span class="math inline">\(c_i\)</span> operators form an effective quadratic Hamiltonian <span class="math inline">\(H = \frac{i}{4} \sum_{\langle i,j\rangle_\alpha} 2J^{\alpha} u_{ij} \hat{c}_i \hat{c}_j\)</span>.</figcaption>
|
||||
</figure>
|
||||
<section id="the-spin-hamiltonian" class="level2">
|
||||
<h2>The Spin Hamiltonian</h2>
|
||||
<p>The Kitaev Honeycomb (KH) model is an exactly solvable model of interacting spin<span class="math inline">\(-1/2\)</span> spins on the vertices of a honeycomb lattice. Each bond in the lattice is assigned a label <span class="math inline">\(\alpha \in \{ x, y, z\}\)</span> and couples two spins along the <span class="math inline">\(\alpha\)</span> axis. See fig. <a href="#fig:intro_figure_by_hand">1</a> for a diagram of the setup.</p>
|
||||
<p>This gives us the Hamiltonian <span id="eq:bg-kh-model"><span class="math display">\[ H = - \sum_{\langle j,k\rangle_\alpha} J^{\alpha}\sigma_j^{\alpha}\sigma_k^{\alpha}, \qquad{(1)}\]</span></span> where <span class="math inline">\(\sigma^\alpha_j\)</span> is the <span class="math inline">\(\alpha\)</span> component of a Pauli matrix acting on site <span class="math inline">\(j\)</span> and <span class="math inline">\(\langle j,k\rangle_\alpha\)</span> is a pair of nearest-neighbour indices connected by an <span class="math inline">\(\alpha\)</span>-bond with exchange coupling <span class="math inline">\(J^\alpha\)</span>. Kitaev introduced this model in his seminal 2006 paper <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">1</a>]</span>.</p>
|
||||
<p>The KH can arise as the result of strong spin-orbit couplings in, for example, the transition metal based compounds <span class="citation" data-cites="Jackeli2009 HerrmannsAnRev2018 Winter2017 TrebstPhysRep2022 Takagi2019"> [<a href="#ref-Jackeli2009" role="doc-biblioref">2</a>–<a href="#ref-Takagi2019" role="doc-biblioref">6</a>]</span>. The model is highly frustrated: each spin would like to align along a different direction with each of its three neighbours but this cannot be achieved even classically <span class="citation" data-cites="chandraClassicalHeisenbergSpins2010 selaOrderbydisorderSpinorbitalLiquids2014"> [<a href="#ref-chandraClassicalHeisenbergSpins2010" role="doc-biblioref">7</a>,<a href="#ref-selaOrderbydisorderSpinorbitalLiquids2014" role="doc-biblioref">8</a>]</span>. This frustration leads the model to have a Quantum Sping Liquid (QSL) ground state, a complex many-body state with a high degree of entanglement but no long-range magnetic order even at zero temperature. While the possibility of a QSL ground state was suggested much earlier <span class="citation" data-cites="Anderson1973"> [<a href="#ref-Anderson1973" role="doc-biblioref">9</a>]</span>, the KH model was the first exactly solvable models of the QSL state. The KH model has a rich ground state phase diagram with gapless and gapped phases, the latter supporting fractionalised quasiparticles with both Abelian and non-Abelian quasiparticle excitations. Anyons have been the subject of much attention because, among other reasons, they can be braided through spacetime to achieve noise tolerant quantum computations <span class="citation" data-cites="freedmanTopologicalQuantumComputation2003"> [<a href="#ref-freedmanTopologicalQuantumComputation2003" role="doc-biblioref">10</a>]</span>. At finite temperature the KH model undergoes a phase transition to a thermal metal state <span class="citation" data-cites="selfThermallyInducedMetallic2019"> [<a href="#ref-selfThermallyInducedMetallic2019" role="doc-biblioref">11</a>]</span>. The KH model can be solved exactly via a mapping to Majorana fermions. This mapping yields an extensive number of static <span class="math inline">\(\mathbb Z_2\)</span> fluxes tied to an emergent gauge field with the remaining fermions are governed by a free fermion hamiltonian.</p>
|
||||
<p>This section will go over the standard model in detail, first discussing <a href="../2_Background/2.2_HKM_Model.html#the-spin-model">the spin model</a>, then detailing the transformation to a <a href="../2_Background/2.2_HKM_Model.html#the-majorana-model">Majorana hamiltonian</a> that allows a full solution while enlarging the Hamiltonian. I will discuss the properties of the <a href="../2_Background/2.2_HKM_Model.html#an-emergent-gauge-field">emergent gauge fields</a> and the projector. The <a href="../2_Background/2.2_HKM_Model.html#sec:anyons">next section</a> will discuss anyons, topology and the Chern number, using the KH model as an explicit example.I will then discuss the ground state found via Lieb’s theorem as well as work on generalisations of the ground state to other lattices. Finally I will present the <a href="../2_Background/2.2_HKM_Model.html#ground-state-phases">phase diagram</a>.</p>
|
||||
<p>The KH can arise as the result of strong spin-orbit couplings in, for example, the transition metal based compounds <span class="citation" data-cites="Jackeli2009 HerrmannsAnRev2018 Winter2017 TrebstPhysRep2022 Takagi2019"> [<a href="#ref-Jackeli2009" role="doc-biblioref">2</a>–<a href="#ref-Takagi2019" role="doc-biblioref">6</a>]</span>. The model is highly frustrated: each spin would like to align along a different direction with each of its three neighbours but this cannot be achieved even classically <span class="citation" data-cites="chandraClassicalHeisenbergSpins2010 selaOrderbydisorderSpinorbitalLiquids2014"> [<a href="#ref-chandraClassicalHeisenbergSpins2010" role="doc-biblioref">7</a>,<a href="#ref-selaOrderbydisorderSpinorbitalLiquids2014" role="doc-biblioref">8</a>]</span>. This frustration leads the model to have a Quantum Spin Liquid (QSL) ground state, a complex many-body state with a high degree of entanglement but no long-range magnetic order even at zero temperature. While the possibility of a QSL ground state was suggested much earlier <span class="citation" data-cites="Anderson1973"> [<a href="#ref-Anderson1973" role="doc-biblioref">9</a>]</span>, the KH model was the first exactly solvable models of the QSL state. The KH model has a rich ground state phase diagram with gapless and gapped phases, the latter supporting fractionalised quasiparticles with both Abelian and non-Abelian quasiparticle excitations. Anyons have been the subject of much attention because, among other reasons, they can be braided through spacetime to achieve noise tolerant quantum computations <span class="citation" data-cites="freedmanTopologicalQuantumComputation2003"> [<a href="#ref-freedmanTopologicalQuantumComputation2003" role="doc-biblioref">10</a>]</span>. At finite temperature the KH model undergoes a phase transition to a thermal metal state <span class="citation" data-cites="selfThermallyInducedMetallic2019"> [<a href="#ref-selfThermallyInducedMetallic2019" role="doc-biblioref">11</a>]</span>. The KH model can be solved exactly via a mapping to Majorana fermions. This mapping yields an extensive number of static <span class="math inline">\(\mathbb Z_2\)</span> fluxes tied to an emergent gauge field with the remaining fermions are governed by a free fermion hamiltonian.</p>
|
||||
<p>This section will go over the standard model in detail, first discussing <a href="../2_Background/2.2_HKM_Model.html#the-spin-model">the spin model</a>, then detailing the transformation to a <a href="../2_Background/2.2_HKM_Model.html#the-majorana-model">Majorana hamiltonian</a> that allows a full solution while enlarging the Hamiltonian. I will discuss the properties of the <a href="../2_Background/2.2_HKM_Model.html#an-emergent-gauge-field">emergent gauge fields</a> and the projector. The <a href="../2_Background/2.2_HKM_Model.html#sec:anyons">next section</a> will discuss anyons, topology and the Chern number, using the KH model as an explicit example. I will then discuss the ground state found via Lieb’s theorem as well as work on generalisations of the ground state to other lattices. Finally, I will present the <a href="../2_Background/2.2_HKM_Model.html#ground-state-phases">phase diagram</a>.</p>
|
||||
</section>
|
||||
<section id="the-spin-model" class="level2">
|
||||
<h2>The Spin Model</h2>
|
||||
@ -103,7 +103,7 @@ image:
|
||||
<p>As discussed in the introduction, spin hamiltonians like that of the KH model arise in electronic systems as the result the balance of multiple effects <span class="citation" data-cites="TrebstPhysRep2022"> [<a href="#ref-TrebstPhysRep2022" role="doc-biblioref">5</a>]</span>. For instance, in certain transition metal systems with <span class="math inline">\(d^5\)</span> valence electrons, crystal field and spin-orbit couplings conspire to shift and split the <span class="math inline">\(d\)</span> orbitals into moments with spin <span class="math inline">\(j = 1/2\)</span> and <span class="math inline">\(j = 3/2\)</span>. Of these, the bandwidth <span class="math inline">\(t\)</span> of the <span class="math inline">\(j= 1/2\)</span> band is small, meaning that even relatively meagre electron correlations (such as those induced by the <span class="math inline">\(U\)</span> term in the Hubbard model) can lead to the opening of a Mott gap. From there we have a <span class="math inline">\(j = 1/2\)</span> Mott insulator whose effective spin-spin interactions are again shaped by the lattice geometry and spin-orbit coupling leading some materials to have strong bond-directional Ising-type interactions <span class="citation" data-cites="jackeliMottInsulatorsStrong2009 khaliullinOrbitalOrderFluctuations2005"> [<a href="#ref-jackeliMottInsulatorsStrong2009" role="doc-biblioref">12</a>,<a href="#ref-khaliullinOrbitalOrderFluctuations2005" role="doc-biblioref">13</a>]</span>. In the KH model the bond directionality refers to the fact that the coupling axis <span class="math inline">\(\alpha\)</span> in terms like <span class="math inline">\(\sigma_j^{\alpha}\sigma_k^{\alpha}\)</span> is strongly bond dependent.</p>
|
||||
<p>In the spin hamiltonian eq. <a href="#eq:bg-kh-model">1</a>, we can already tease out a set of conserved fluxes that will be key to the model’s solution. These fluxes are the expectations of Wilson loop operators</p>
|
||||
<p><span class="math display">\[\hat{W}_p = \prod_{\langle i,j\rangle_\alpha\; \in\; p} \sigma_i^{\alpha}\sigma_j^{\alpha},\]</span></p>
|
||||
<p>the products of bonds winding around a closed path <span class="math inline">\(p\)</span> on the lattice. These operators commute with the Hamiltonian and so have no time dynamics. The winding direction does not matter so long as it is fixed. By convention we will always use clockwise. Each closed path on the lattice is associated with a flux. The number of conserved quantities grows linearly with system size and is thus extensive. This is a common property for exactly solvable systems and can be compared to the heavy electrons present in the Falicov-Kimball model. The square of two loop operators is one so any contractible loop can be expressed as a product of loops around plaquettes of the lattice, as in fig. <a href="#fig:stokes_theorem">3</a>. For the honeycomb lattice, the plaquettes are the hexagons. The expectations of <span class="math inline">\(\hat{W}_p\)</span> through each plaquette, the fluxes, are therefore enough to describe the whole flux sector. We will focus on these fluxes, denoting them by <span class="math inline">\(\phi_i\)</span>. Once we have made the mapping to the Majorana Hamiltonian, I will explain how these fluxes can be connected to an emergent <span class="math inline">\(B\)</span> field which makes their interpretation as fluxes clear.</p>
|
||||
<p>the products of bonds winding around a closed path <span class="math inline">\(p\)</span> on the lattice. These operators commute with the Hamiltonian and so have no time dynamics. The winding direction does not matter so long as it is fixed. By convention, we will always use clockwise. Each closed path on the lattice is associated with a flux. The number of conserved quantities grows linearly with system size and is thus extensive. This is a common property for exactly solvable systems and can be compared to the heavy electrons present in the Falicov-Kimball model. The square of two loop operators is one so any contractible loop can be expressed as a product of loops around plaquettes of the lattice, as in fig. <a href="#fig:stokes_theorem">3</a>. For the honeycomb lattice, the plaquettes are the hexagons. The expectations of <span class="math inline">\(\hat{W}_p\)</span> through each plaquette, the fluxes, are therefore enough to describe the whole flux sector. We will focus on these fluxes, denoting them by <span class="math inline">\(\phi_i\)</span>. Once we have made the mapping to the Majorana Hamiltonian, I will explain how these fluxes can be connected to an emergent <span class="math inline">\(B\)</span> field which makes their interpretation as fluxes clear.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/amk_chapter/stokes_theorem/stokes_theorem.svg" id="fig-stokes_theorem" data-short-caption="We can construct arbitrary loops from plaquette fluxes." style="width:71.0%" alt="Figure 3: In the KH model, Wilson loop operators \hat{W}_p = \prod_{\langle i,j\rangle_\alpha\; \in\; p} \sigma_i^{\alpha}\sigma_j^{\alpha} can be composed via multiplication to produce arbitrary contractible loops. As a consequence, we need only to keep track of the value of the flux through each plaquette \phi_i. This relationship between the u_{ij} around a region and the fluxes inside it is evocative of Stokes’ theorem from classical electromagnetism. Indeed, it turns out to be closely related as we shall see later." />
|
||||
<figcaption aria-hidden="true">Figure 3: In the KH model, Wilson loop operators <span class="math inline">\(\hat{W}_p = \prod_{\langle i,j\rangle_\alpha\; \in\; p} \sigma_i^{\alpha}\sigma_j^{\alpha}\)</span> can be composed via multiplication to produce arbitrary contractible loops. As a consequence, we need only to keep track of the value of the flux through each plaquette <span class="math inline">\(\phi_i\)</span>. This relationship between the <span class="math inline">\(u_{ij}\)</span> around a region and the fluxes inside it is evocative of Stokes’ theorem from classical electromagnetism. Indeed, it turns out to be closely related as we shall see later.</figcaption>
|
||||
@ -116,40 +116,40 @@ image:
|
||||
<p>Majorana fermions are something like ‘half of a complex fermion’ and are their own antiparticle. From a set of <span class="math inline">\(N\)</span> fermionic creation <span class="math inline">\(f_i^\dagger\)</span> and anhilation <span class="math inline">\(f_i\)</span> operators, we can construct <span class="math inline">\(2N\)</span> Majorana operators <span class="math inline">\(c_m\)</span>. We can do this construction in multiple ways subject to only mild constraints required to keep the overall commutations relations correct <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">1</a>]</span>. Majorana operators square to one, but otherwise have standard fermionic anti-commutation relations.</p>
|
||||
<p><span class="math inline">\(N\)</span> spins can be mapped to <span class="math inline">\(N\)</span> fermions with the well-known Jordan-Wigner transformation and indeed this approach can be used to solve the Kitaev model <span class="citation" data-cites="chenExactResultsKitaev2008"> [<a href="#ref-chenExactResultsKitaev2008" role="doc-biblioref">15</a>]</span>. Here I will introduce the method Kitaev used in the original paper as this forms the basis for the results that will be presented in this thesis. Rather than mapping to <span class="math inline">\(N\)</span> fermions, Kitaev maps to <span class="math inline">\(4N\)</span> Majoranas, effectively <span class="math inline">\(2N\)</span> fermions. In contrast to the Jordan-Wigner transformation which makes fermions out of strings of spin operators in order to correctly produce fermionic commutation relations, the Kitaev transformation maps each spin locally to four Majoranas. The downside is that this enlarges the Hilbert space from <span class="math inline">\(2^N\)</span> to <span class="math inline">\(4^N\)</span>. We will have to employ a projector <span class="math inline">\(\hat{P}\)</span> to come back down to the physical Hilbert space later. As everything is local, I will drop the site indices <span class="math inline">\(ijk\)</span> in expressions that refer to only a single site.</p>
|
||||
<p>The mapping is defined in terms of four Majoranas per site <span class="math inline">\(b_i^x,\;b_i^y,\;b_i^z,\;c_i\)</span> such that</p>
|
||||
<p><span id="eq:bg-kh-mapping"><span class="math display">\[\tilde{\sigma}^x = i b^x c,\; \tilde{\sigma}^y = i b^y c,\; \tilde{\sigma}^z = i b^z c\qquad{(2)}\]</span></span></p>
|
||||
<p>The tildes on the spin operators <span class="math inline">\(\tilde{\sigma_i^\alpha}\)</span> emphasise that they live in this new extended Hilbert space and are only equivalent to the original spin operators after applying a projector <span class="math inline">\(\hat{P}\)</span>. The form of the projection operator can be understood in a few ways. From a group-theoretic perspective, before projection, the operators <span class="math inline">\(\{\tilde{\sigma}^x, \tilde{\sigma}^y, \tilde{\sigma}^z\}\)</span> form a representation of the gamma group <span class="math inline">\(G_{3,0}\)</span>. The gamma groups <span class="math inline">\(G_{p,q}\)</span> have <span class="math inline">\(p\)</span> generators that square to the identity and <span class="math inline">\(q\)</span> that square (roughly) to <span class="math inline">\(-1\)</span>. The generators otherwise obey standard anticommutation relations. The well known gamma matrices <span class="math inline">\(\{\gamma^0, \gamma^1, \gamma^2, \gamma^3\}\)</span> represent <span class="math inline">\(G_{1,3}\)</span> the quaternions <span class="math inline">\(G_{0,3}\)</span> and the Pauli matrices <span class="math inline">\(G_{3,0}\)</span>.</p>
|
||||
<p>The Pauli matrices, however, have the additional property that the <em>chiral element</em> <span class="math inline">\(\sigma^x \sigma^y \sigma^z = \pm i\)</span> is not fully determined by the group properties of <span class="math inline">\(G_{3,0}\)</span>, but it is equal to <span class="math inline">\(i\)</span> in the Pauli matrices. Therefore, to fully reproduce the algebra of the Pauli matrices, we must project into the subspace where <span class="math inline">\(\tilde{\sigma}^x \tilde{\sigma}^y \tilde{\sigma}^z = +i\)</span>. The chiral element of the gamma matrices for instance <span class="math inline">\(\gamma_5 = i\gamma^0 \gamma^1 \gamma^2 \gamma^3\)</span> is of central importance in quantum field theory. See <span class="citation" data-cites="petitjeanChiralityDiracSpinors2020"> [<a href="#ref-petitjeanChiralityDiracSpinors2020" role="doc-biblioref">16</a>]</span> for more discussion of this group theoretic view.</p>
|
||||
<p><span id="eq:bg-kh-mapping"><span class="math display">\[\tilde{\sigma}^x = i b^x c,\; \tilde{\sigma}^y = i b^y c,\; \tilde{\sigma}^z = i b^z c.\qquad{(2)}\]</span></span></p>
|
||||
<p>The tildes on the spin operators <span class="math inline">\(\tilde{\sigma_i}^\alpha\)</span> emphasise that they live in this new extended Hilbert space and are only equivalent to the original spin operators after applying a projector <span class="math inline">\(\hat{P}\)</span>. The form of the projection operator can be understood in a few ways. From a group-theoretic perspective, before projection, the operators <span class="math inline">\(\{\tilde{\sigma}^x, \tilde{\sigma}^y, \tilde{\sigma}^z\}\)</span> form a representation of the gamma group <span class="math inline">\(G_{3,0}\)</span>. The gamma groups <span class="math inline">\(G_{p,q}\)</span> have <span class="math inline">\(p\)</span> generators that square to the identity and <span class="math inline">\(q\)</span> that square (roughly) to <span class="math inline">\(-1\)</span>. The generators otherwise obey standard anticommutation relations. The well-known gamma matrices <span class="math inline">\(\{\gamma^0, \gamma^1, \gamma^2, \gamma^3\}\)</span> represent <span class="math inline">\(G_{1,3}\)</span> the quaternions <span class="math inline">\(G_{0,3}\)</span> and the Pauli matrices <span class="math inline">\(G_{3,0}\)</span>.</p>
|
||||
<p>The Pauli matrices, however, have the additional property that the <em>chiral element</em> <span class="math inline">\(\sigma^x \sigma^y \sigma^z = \pm i\)</span> is not fully determined by the group properties of <span class="math inline">\(G_{3,0}\)</span>, but it is equal to <span class="math inline">\(i\)</span> in the Pauli matrices. Therefore, to fully reproduce the algebra of the Pauli matrices, we must project into the subspace where <span class="math inline">\(\tilde{\sigma}^x \tilde{\sigma}^y \tilde{\sigma}^z = +i\)</span>. The chiral element of the gamma matrices for instance <span class="math inline">\(\gamma_5 = i\gamma^0 \gamma^1 \gamma^2 \gamma^3\)</span> is of central importance in quantum field theory. See ref. <span class="citation" data-cites="petitjeanChiralityDiracSpinors2020"> [<a href="#ref-petitjeanChiralityDiracSpinors2020" role="doc-biblioref">16</a>]</span> for more discussion of this group theoretic view.</p>
|
||||
<p>So the projector must project onto the subspace where <span class="math inline">\(\tilde \sigma^x \tilde \sigma^y \tilde \sigma^z = i\)</span>. If we work this through, we find that in general <span class="math inline">\(\tilde \sigma^x \tilde \sigma^y \tilde\sigma^z = iD\)</span> where <span class="math inline">\(D = b^x b^y b^z c\)</span> must be the identity for every site. In other words, we can only work with <em>physical states</em> <span class="math inline">\(|\phi\rangle\)</span> that satisfy <span class="math inline">\(D_i|\phi\rangle = |\phi\rangle\)</span> for all sites <span class="math inline">\(i\)</span>. From this we construct an on-site projector <span class="math inline">\(P_i = \frac{1 + D_i}{2}\)</span> and the overall projector is simply <span class="math inline">\(P = \prod_i P_i\)</span>.</p>
|
||||
<p>Another way to see what this is doing physically is to explicitly construct the two intermediate fermionic operators <span class="math inline">\(f\)</span> and <span class="math inline">\(g\)</span> that give rise to these four Majoranas. Denoting a fermion state by <span class="math inline">\(|n_f, n_g\rangle\)</span> the Hilbert space is the set <span class="math inline">\(\{|00\rangle,|01\rangle,|10\rangle,|11\rangle\}\)</span>. We can map these to Majoranas with, for example, this definition</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
b^x = (f + f^\dagger),\;\;& b^y = -i(f - f^\dagger),\\
|
||||
b^z = (g + g^\dagger),\;\;& c = -i(g - g^\dagger),
|
||||
b^z = (g + g^\dagger),\;\;& c = -i(g - g^\dagger).
|
||||
\end{aligned}\]</span></p>
|
||||
<p>Working through the algebra, we see that the operator <span class="math inline">\(D = b^x b^y b^z c\)</span> is equal to the fermion parity <span class="math inline">\(D = -(2n_f - 1)(2n_g - 1) = \pm1\)</span> where <span class="math inline">\(n_f,\; n_g\)</span> are the number operators. So setting <span class="math inline">\(D = 1\)</span> everywhere is equivalent to restricting to the <span class="math inline">\(\{|01\rangle,|10\rangle\}\)</span> though we could equally well have used <span class="math inline">\(D = -1\)</span>.</p>
|
||||
<p>Expanding the product <span class="math inline">\(\prod_i P_i\)</span> out, we find that the projector corresponds to a symmetrisation over <span class="math inline">\(\{u_{ij}\}\)</span> states within a flux sector and overall fermion parity <span class="math inline">\(\prod_i D_i\)</span>, see <span class="citation" data-cites="pedrocchiPhysicalSolutionsKitaev2011"> [<a href="#ref-pedrocchiPhysicalSolutionsKitaev2011" role="doc-biblioref">17</a>]</span> or <a href="../6_Appendices/A.5_The_Projector.html#app-the-projector">appendix A.6</a> for the full derivation. The significance of this is that an arbitrary many-body state can be made to have non-zero overlap with the physical subspace via the addition or removal of just a single fermion. This implies that, in the thermodynamic limit, the projection step is not generally necessary to extract physical results</p>
|
||||
<p>Working through the algebra, we see that the operator <span class="math inline">\(D = b^x b^y b^z c\)</span> is equal to the fermion parity <span class="math inline">\(D = -(2n_f - 1)(2n_g - 1) = \pm1\)</span> where <span class="math inline">\(n_f,\; n_g\)</span> are the number operators. So setting <span class="math inline">\(D = 1\)</span> everywhere is equivalent to restricting to the states <span class="math inline">\(\{|01\rangle\)</span> and <span class="math inline">\(|10\rangle\}\)</span>, though we could equally well have used <span class="math inline">\(D = -1\)</span>.</p>
|
||||
<p>Expanding the product <span class="math inline">\(\prod_i P_i\)</span> out, we find that the projector corresponds to a symmetrisation over <span class="math inline">\(\{u_{ij}\}\)</span> states within a flux sector and overall fermion parity <span class="math inline">\(\prod_i D_i\)</span>, see ref. <span class="citation" data-cites="pedrocchiPhysicalSolutionsKitaev2011"> [<a href="#ref-pedrocchiPhysicalSolutionsKitaev2011" role="doc-biblioref">17</a>]</span> or <a href="../6_Appendices/A.5_The_Projector.html#app-the-projector">appendix A.6</a> for the full derivation. The significance of this is that an arbitrary many-body state can be made to have non-zero overlap with the physical subspace via the addition or removal of just a single fermion. This implies that, in the thermodynamic limit, the projection step is not generally necessary to extract physical results</p>
|
||||
<p>We can now rewrite the spin hamiltonian in Majorana form with the caveat that they are only strictly equivalent after projection. The Ising interactions <span class="math inline">\(\sigma_j^{\alpha}\sigma_k^{\alpha}\)</span> decouple into the form <span class="math inline">\(-i (i b^\alpha_i b^\alpha_j) c_i c_j\)</span>. We factor out the <em>bond operators</em> <span class="math inline">\(\hat{u}_{ij} = i b^\alpha_i b^\alpha_j\)</span> which are Hermitian and, remarkably, commute with the Hamiltonian and each other.</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
\tilde{H} &= - \sum_{\langle i,j\rangle_\alpha} J^{\alpha}\tilde{\sigma}_i^{\alpha}\tilde{\sigma}_j^{\alpha}\\
|
||||
&= i \sum_{\langle i,j\rangle_\alpha} J^{\alpha} \hat{u}_{ij} \hat{c}_i \hat{c}_j
|
||||
&= i \sum_{\langle i,j\rangle_\alpha} J^{\alpha} \hat{u}_{ij} \hat{c}_i \hat{c}_j.
|
||||
\end{aligned}\]</span></p>
|
||||
<p>The bond operators <span class="math inline">\(\hat{u}_{ij}\)</span> square to one so have eigenvalues <span class="math inline">\(\pm1\)</span>. As they’re conserved we will work in their eigenbasis and take off the hats in the Hamiltonian.</p>
|
||||
<p>The bond operators <span class="math inline">\(\hat{u}_{ij}\)</span> square to one so have eigenvalues <span class="math inline">\(\pm1\)</span>. As they are conserved we will work in their eigenbasis and take off the hats in the Hamiltonian.</p>
|
||||
<p><span id="eq:bg-kh-maj-model"><span class="math display">\[\begin{aligned}
|
||||
H &= i \sum_{\langle i,j\rangle_\alpha} J^{\alpha} u_{ij} \hat{c}_i \hat{c}_j
|
||||
H &= i \sum_{\langle i,j\rangle_\alpha} J^{\alpha} u_{ij} \hat{c}_i \hat{c}_j.
|
||||
\end{aligned}\qquad{(3)}\]</span></span></p>
|
||||
</section>
|
||||
<section id="the-fermion-problem" class="level2">
|
||||
<h2>The Fermion Problem</h2>
|
||||
<p>We now have a quadratic Hamiltonian, eq. <a href="#eq:bg-kh-maj-model">3</a>, coupled to a classical field <span class="math inline">\(u_{ij}\)</span>. What follows is relatively standard theory for quadratic Hamiltonians <span class="citation" data-cites="BlaizotRipka1986"> [<a href="#ref-BlaizotRipka1986" role="doc-biblioref">18</a>]</span>.</p>
|
||||
<p>Because of the antisymmetry <span class="math inline">\(J^{\alpha} u_{ij}\)</span>, the eigenvalues of eq. <a href="#eq:bg-kh-maj-model">3</a> come in pairs <span class="math inline">\(\pm \epsilon_m\)</span>. We organise the eigenmodes of <span class="math inline">\(H\)</span> into pairs, such that <span class="math inline">\(b_m\)</span> and <span class="math inline">\(b_m'\)</span> have energies <span class="math inline">\(\epsilon_m\)</span> and <span class="math inline">\(-\epsilon_m\)</span>. The transformation <span class="math inline">\(Q\)</span></p>
|
||||
<p><span class="math display">\[(c_1, c_2... c_{2N}) Q = (b_1, b_1', b_2, b_2' ... b_{N}, b_{N}')\]</span></p>
|
||||
<p><span class="math display">\[(c_1, c_2... c_{2N}) Q = (b_1, b_1', b_2, b_2' ... b_{N}, b_{N}'),\]</span></p>
|
||||
<p>puts the Hamiltonian into normal mode form</p>
|
||||
<p><span class="math display">\[H = \frac{i}{2} \sum_m \epsilon_m b_m b_m'.\]</span></p>
|
||||
<p>The determinant of <span class="math inline">\(Q\)</span> appears when evaluating the projector explicitly, otherwise, the <span class="math inline">\(b_m\)</span> are merely an intermediate step. From them, we form fermionic operators</p>
|
||||
<p><span class="math display">\[ f_i = \tfrac{1}{2} (b_m + ib_m')\]</span></p>
|
||||
<p><span class="math display">\[ f_i = \tfrac{1}{2} (b_m + ib_m'),\]</span></p>
|
||||
<p>with their associated number operators <span class="math inline">\(n_i = f^\dagger_i f_i\)</span>. These let us write the Hamiltonian neatly as</p>
|
||||
<p><span class="math display">\[ H = \sum_m \epsilon_m (n_m - \tfrac{1}{2}).\]</span></p>
|
||||
<p>The energy of the ground state <span class="math inline">\(|n_m = 0\rangle\)</span> of the many-body system at fixed <span class="math inline">\(\{u_{ij}\}\)</span> is</p>
|
||||
<p><span class="math display">\[E_{0} = -\frac{1}{2}\sum_m \epsilon_m \]</span></p>
|
||||
<p><span class="math display">\[E_{0} = -\frac{1}{2}\sum_m \epsilon_m, \]</span></p>
|
||||
<p>and we can construct any state from a particular choice of <span class="math inline">\(n_m = 0,1\)</span>. If we only care about the ground state energy <span class="math inline">\(E_{0}\)</span>, it is possible to skip forming the fermionic operators. The eigenvalues obtained directly from diagonalising <span class="math inline">\(J^{\alpha} u_{ij}\)</span> come in <span class="math inline">\(\pm \epsilon_m\)</span> pairs. We can take half the absolute value of the set to recover <span class="math inline">\(\sum_m \epsilon_m\)</span> directly.</p>
|
||||
</section>
|
||||
<section id="an-emergent-gauge-field" class="level2">
|
||||
@ -160,10 +160,10 @@ H &= i \sum_{\langle i,j\rangle_\alpha} J^{\alpha} u_{ij} \hat{c}_i \hat{c}
|
||||
<img src="/assets/thesis/amk_chapter/intro/gauge_symmetries/gauge_symmetries.svg" id="fig-gauge_symmetries" data-short-caption="Gauge Symmetries" style="width:100.0%" alt="Figure 4: A honeycomb lattice with edges in grey, along with its dual, the triangle lattice in red. The vertices of the dual lattice are the faces of the original lattice and, hence, are the locations of the vortices. (Left) The action of the gauge operator D_j at a vertex is to flip the value of the three u_{jk} variables (black lines) surrounding site j. The corresponding edges of the dual lattice (red lines) form a closed triangle. (Middle) Composing multiple adjacent D_j operators produces a large, closed dual loop or multiple disconnected dual loops. Dual loops are not directed like Wilson loops. (Right) A non-contractable loop which cannot be produced by composing D_j operators. All three operators can be thought of as the action of a vortex-vortex pair that is created, one of them is transported around the loop, and then the two annihilate again. Note that every plaquette has an even number of u_{ij}s flipped on its edge. Therefore, all retain the same flux \phi_i." />
|
||||
<figcaption aria-hidden="true">Figure 4: A honeycomb lattice with edges in grey, along with its dual, the triangle lattice in red. The vertices of the dual lattice are the faces of the original lattice and, hence, are the locations of the vortices. (Left) The action of the gauge operator <span class="math inline">\(D_j\)</span> at a vertex is to flip the value of the three <span class="math inline">\(u_{jk}\)</span> variables (black lines) surrounding site <span class="math inline">\(j\)</span>. The corresponding edges of the dual lattice (red lines) form a closed triangle. (Middle) Composing multiple adjacent <span class="math inline">\(D_j\)</span> operators produces a large, closed dual loop or multiple disconnected dual loops. Dual loops are not directed like Wilson loops. (Right) A non-contractable loop which cannot be produced by composing <span class="math inline">\(D_j\)</span> operators. All three operators can be thought of as the action of a vortex-vortex pair that is created, one of them is transported around the loop, and then the two annihilate again. Note that every plaquette has an even number of <span class="math inline">\(u_{ij}\)</span>s flipped on its edge. Therefore, all retain the same flux <span class="math inline">\(\phi_i\)</span>.</figcaption>
|
||||
</figure>
|
||||
<p>In addition, the bond operators form a highly degenerate description of the system. The operators <span class="math inline">\(D_i = b^x_i b^y_i b^z_i c_i\)</span> commute with <span class="math inline">\(H\)</span> forming a set of local symmetries. The action of <span class="math inline">\(D_i\)</span> on a state is to flip the values of the three <span class="math inline">\(u_{ij}\)</span> bonds that connect to site <span class="math inline">\(i\)</span>. This changes the bond configuration <span class="math inline">\(\{u_{ij}\}\)</span> but leaves the flux configuration <span class="math inline">\(\{\phi_i\}\)</span> unchanged. Physically, we interpret <span class="math inline">\(u_{ij}\)</span> as a gauge field with a high degree of degeneracy and <span class="math inline">\(\{D_i\}\)</span> as the set of gauge symmetries. The Majorana bond operators <span class="math inline">\(u_{ij}\)</span> are an emergent, classical, <span class="math inline">\(\mathbb{Z}_2\)</span> gauge field! The flux configuration <span class="math inline">\(\{\phi_i\}\)</span> is what encodes physical information about the system without all the gauge degeneracy.</p>
|
||||
<p>In addition, the bond operators form a highly degenerate description of the system. The operators <span class="math inline">\(D_i = b^x_i b^y_i b^z_i c_i\)</span> commute with <span class="math inline">\(H\)</span> forming a set of local symmetries. The action of <span class="math inline">\(D_i\)</span> on a state is to flip the values of the three <span class="math inline">\(u_{ij}\)</span> bonds that connect to site <span class="math inline">\(i\)</span>. This changes the bond configuration <span class="math inline">\(\{u_{ij}\}\)</span> but leaves the flux configuration <span class="math inline">\(\{\phi_i\}\)</span> unchanged. Physically, we interpret <span class="math inline">\(u_{ij}\)</span> as a gauge field with a high degree of degeneracy and <span class="math inline">\(\{D_i\}\)</span> as the set of gauge symmetries. Products of the gauge symmetries correspond to closed loops on the dual lattice, see fig. <a href="#fig:gauge_symmetries">4</a>. The Majorana bond operators <span class="math inline">\(u_{ij}\)</span> are an emergent, classical, <span class="math inline">\(\mathbb{Z}_2\)</span> gauge field! The flux configuration <span class="math inline">\(\{\phi_i\}\)</span> is what encodes physical information about the system without all the gauge degeneracy.</p>
|
||||
<p>The ground state of the KH model is the flux configuration where all fluxes are one <span class="math inline">\(\{\phi_i = +1\; \forall \; i\}\)</span>. This can be proven via Lieb’s theorem <span class="citation" data-cites="lieb_flux_1994"> [<a href="#ref-lieb_flux_1994" role="doc-biblioref">19</a>]</span> which gives the lowest energy magnetic flux configuration for a system of electrons hopping in a magnetic field. Kitaev remarks in his original paper that he was not initially aware of the relevance of Lieb’s 1994 result. This is not surprising because at first glance the two models seem quite different. Yet the connection is quite instructive for understanding the KH and its generalisations.</p>
|
||||
<p>Lieb discussed a model of mobile electrons</p>
|
||||
<p><span class="math display">\[H = \sum_{ij} t_{ij} c^\dagger_i c_j\]</span></p>
|
||||
<p><span class="math display">\[H = \sum_{ij} t_{ij} c^\dagger_i c_j,\]</span></p>
|
||||
<p>where the hopping terms <span class="math inline">\(t_{ij} = |t_{ij}|\exp(i\theta_{ij})\)</span> incorporate Aharanhov-Bohm (AB) phases <span class="citation" data-cites="aharonovSignificanceElectromagneticPotentials1959"> [<a href="#ref-aharonovSignificanceElectromagneticPotentials1959" role="doc-biblioref">20</a>]</span> <span class="math inline">\(\theta_{ij}\)</span>. The AB phases model the effect of a slowly varying magnetic field on the electrons through the integral of the magnetic vector potential</p>
|
||||
<p><span class="math display">\[\theta_{ij} = \int_i^j \vec{A} \cdot d\vec{l},\]</span></p>
|
||||
<p>a Peierls substitution <span class="citation" data-cites="peierlsZurTheorieDiamagnetismus1933"> [<a href="#ref-peierlsZurTheorieDiamagnetismus1933" role="doc-biblioref">21</a>]</span>. If we map the Majorana form of the Kitaev model to Lieb’s model, we see that our <span class="math inline">\(t_{ij} = i J^\alpha u_{ij}\)</span>. The <span class="math inline">\(i u_{ij} = \pm i\)</span> correspond to AB phases <span class="math inline">\(\theta_{ij} = \pi/2\)</span> or <span class="math inline">\(3\pi/2\)</span> along each bond.</p>
|
||||
@ -177,7 +177,7 @@ H &= i \sum_{\langle i,j\rangle_\alpha} J^{\alpha} u_{ij} \hat{c}_i \hat{c}
|
||||
&= \prod_{\mathcal{P}_i} \exp{(i\theta_{jk}})\\
|
||||
&= \exp \left( i \sum_{\mathcal{P}_i} \theta_{jk} \right)\\
|
||||
&= \exp \left( i \oint_{\mathcal{P}_i} \vec{A} \cdot d\vec{l} \right)\\
|
||||
&= \exp \left( i Q_i \right)
|
||||
&= \exp \left( i Q_i \right).
|
||||
\end{aligned}\qquad{(5)}\]</span></span></p>
|
||||
<p>Thus, we can interpret the fluxes <span class="math inline">\(\phi_i\)</span> as the exponential of magnetic fluxes <span class="math inline">\(Q_m\)</span> of some fictitious gauge field <span class="math inline">\(\vec{A}\)</span> and the bond operators as <span class="math inline">\(i u_{ij} = \exp i \int_i^j \vec{A} \cdot d\vec{l}\)</span>. In this analogy to classical electromagnetism, the sets <span class="math inline">\(\{u_{ij}\}\)</span> that correspond to the same <span class="math inline">\(\{\phi_i\}\)</span> are all gauge equivalent as we have already seen via other means. The fact that fluxes can be written as products of bond operators and composed is a consequence of eq. <a href="#eq:flux-magnetic">5</a>. If the lattice contains odd plaquettes, as in the Yao-Kivelson model <span class="citation" data-cites="yaoExactChiralSpin2007"> [<a href="#ref-yaoExactChiralSpin2007" role="doc-biblioref">26</a>]</span>, the complex fluxes that appear are a sign that chiral symmetry has been broken.</p>
|
||||
<p>In full, Lieb’s theorem states that the ground state has magnetic flux <span class="math inline">\(Q_i = \sum_{\mathcal{P}_i}\theta_{ij} = \pi \; (\mathrm{mod} \;2\pi)\)</span> for plaquettes with <span class="math inline">\(0 \; (\mathrm{mod}\;4)\)</span> sides and <span class="math inline">\(0 \; (\mathrm{mod}\;2\pi)\)</span> for plaquettes with <span class="math inline">\(2 \; (\mathrm{mod}\;4)\)</span> sides. In terms of our fluxes, this means <span class="math inline">\(\phi = -1\)</span> for squares, <span class="math inline">\(\phi = 1\)</span> for hexagons and so on.</p>
|
||||
@ -195,7 +195,7 @@ A honeycomb lattice (in black) along with its dual (in red). (Left) The product
|
||||
<img src="/assets/thesis/amk_chapter/topological_fluxes.png" id="fig-topological_fluxes" data-short-caption="Topological Fluxes" style="width:57.0%" alt="Figure 6: Wilson loops that wind the major or minor diameters of the torus measure flux winding through the hole of the doughnut/torus (red) or through the filling (green). If they made doughnuts which had both had a jam filling and a hole, this analogy would be a lot easier to make [27]." />
|
||||
<figcaption aria-hidden="true">Figure 6: Wilson loops that wind the major or minor diameters of the torus measure flux winding through the hole of the doughnut/torus (red) or through the filling (green). If they made doughnuts which had both had a jam filling and a hole, this analogy would be a lot easier to make <span class="citation" data-cites="parkerWhyDoesThis"> [<a href="#ref-parkerWhyDoesThis" role="doc-biblioref">27</a>]</span>.</figcaption>
|
||||
</figure>
|
||||
<p>A final but important point to mention is that the local fluxes <span class="math inline">\(\phi_i\)</span> are not quite all there is. We’ve seen that products of <span class="math inline">\(\phi_i\)</span> can be used to construct the flux associated with arbitrary contractible loops. On the plane contractible loops are all there is. However, on the torus we can construct two global fluxes <span class="math inline">\(\Phi_x\)</span> and <span class="math inline">\(\Phi_y\)</span> which correspond to paths tracing the major and minor axes. The four sectors spanned by the <span class="math inline">\(\pm1\)</span> values of these fluxes are gapped away from one another, but only by virtual tunnelling processes so the gap decays exponentially with system size <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">1</a>]</span>. Physically <span class="math inline">\(\Phi_x\)</span> and <span class="math inline">\(\Phi_y\)</span> could be thought of as measuring the flux that threads through the hole of the doughnut. In general, surfaces with genus <span class="math inline">\(g\)</span> have <span class="math inline">\(g\)</span> ‘handles’ and <span class="math inline">\(2g\)</span> of these global fluxes. At first glance, it may seem that these fluxes would not have much relevance to physical realisations of the Kitaev model which are likely to have a planar geometry. However these fluxes are closely linked to topology and the existence of anyonic quasiparticle excitations in the model, which we will discuss next.</p>
|
||||
<p>A final but important point to mention is that the local fluxes <span class="math inline">\(\phi_i\)</span> are not quite all there is. We’ve seen that products of <span class="math inline">\(\phi_i\)</span> can be used to construct the flux associated with arbitrary contractible loops. On the plane contractible loops are all there is. However, on the torus we can construct two global fluxes <span class="math inline">\(\Phi_x\)</span> and <span class="math inline">\(\Phi_y\)</span> which correspond to paths tracing the major and minor axes. The four sectors spanned by the <span class="math inline">\(\pm1\)</span> values of these fluxes are gapped away from one another, but only by virtual tunnelling processes so the gap decays exponentially with system size <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">1</a>]</span>. Physically <span class="math inline">\(\Phi_x\)</span> and <span class="math inline">\(\Phi_y\)</span> could be thought of as measuring the flux that threads through the hole of the doughnut. In general, surfaces with genus <span class="math inline">\(g\)</span> have <span class="math inline">\(g\)</span> ‘handles’ and <span class="math inline">\(2g\)</span> of these global fluxes. At first glance, it may seem that these fluxes would not have much relevance to physical realisations of the Kitaev model which are likely to have a planar geometry. However, these fluxes are closely linked to topology and the existence of anyonic quasiparticle excitations in the model, which we will discuss next.</p>
|
||||
</section>
|
||||
<section id="sec:anyons" class="level2">
|
||||
<h2>Anyons, Topology and the Chern number</h2>
|
||||
@ -203,7 +203,7 @@ A honeycomb lattice (in black) along with its dual (in red). (Left) The product
|
||||
<img src="/assets/thesis/amk_chapter/braiding.png" id="fig-braiding" data-short-caption="Braiding in Two Dimensions" style="width:71.0%" alt="Figure 7: Worldlines of particles in 2D can become tangled or braided with one another." />
|
||||
<figcaption aria-hidden="true">Figure 7: Worldlines of particles in 2D can become tangled or <em>braided</em> with one another.</figcaption>
|
||||
</figure>
|
||||
<p>To discuss different ground state phases of the KH model, we must first review the topic of anyons and topology. The standard argument for the existence of Fermions and Bosons goes like this: the quantum state of a system must pick up a factor of <span class="math inline">\(\pm1\)</span> if two identical particles are swapped. Only <span class="math inline">\(\pm1\)</span> are allowed since swapping twice must correspond to the identity. This argument works in 3D for states without topological degeneracy, which seems to be true of the real world, but condensed matter systems are subject to no such constraints.</p>
|
||||
<p>To discuss different ground state phases of the KH model, we must first review the topic of anyons and topology. The standard argument for the existence of fermions and bosons goes like this: the quantum state of a system must pick up a factor of <span class="math inline">\(\pm1\)</span> if two identical particles are swapped. Only <span class="math inline">\(\pm1\)</span> are allowed since swapping twice must correspond to the identity. This argument works in 3D for states without topological degeneracy, which seems to be true of the real world, but condensed matter systems are subject to no such constraints.</p>
|
||||
<p>In gapped condensed matter systems, all equal time correlators decay exponentially with distance <span class="citation" data-cites="hastingsLiebSchultzMattisHigherDimensions2004"> [<a href="#ref-hastingsLiebSchultzMattisHigherDimensions2004" role="doc-biblioref">28</a>]</span>. Put another way, gapped systems support quasiparticles with a definite location in space and finite extent. As such it is meaningful to consider what would happen to the overall quantum state if we were to adiabatically carry out a series of swaps as described above. This is known as braiding. Recently, braiding in topological systems has attracted interest because of proposals to use ground state degeneracy to implement both passively fault tolerant and actively stabilised quantum computations <span class="citation" data-cites="kitaev_fault-tolerant_2003 poulinStabilizerFormalismOperator2005 hastingsDynamicallyGeneratedLogical2021"> [<a href="#ref-kitaev_fault-tolerant_2003" role="doc-biblioref">29</a>–<a href="#ref-hastingsDynamicallyGeneratedLogical2021" role="doc-biblioref">31</a>]</span>.</p>
|
||||
<p>First we realise that, in 2D, swapping identical particles twice is not topologically equivalent to the identity, see fig. <a href="#fig:braiding">7</a>. Instead it corresponds to encircling one particle around the other. This means we can in general pick up any complex phase <span class="math inline">\(e^{i\theta}\)</span> upon exchange, hence the name <em>any</em>-ons. Those that pick up a complex phase are known as Abelian anyons because complex multiplication commutes and hence the group of braiding operations on them forms an Abelian group.</p>
|
||||
<p>The KH model has a topologically degenerate ground state with sectors labelled by the values of the topological fluxes <span class="math inline">\((\Phi_x\)</span>, <span class="math inline">\(\Phi_y)\)</span>. Consider the operation in which a quasiparticle pair is created from the ground state, transported around one of the non-contractible loops, and then annihilated together, call them <span class="math inline">\(\mathcal{T}_{x}\)</span> and <span class="math inline">\(\mathcal{T}_{y}\)</span>. These operations move the system around within the ground state manifold and they need not commute. This leads to non-Abelian anyons. As Kitaev pointed out, these operations are not specific to the torus: the operation <span class="math inline">\(\mathcal{T}_{x}\mathcal{T}_{y}\mathcal{T}_{x}^{-1}\mathcal{T}_{y}^{-1}\)</span> corresponds to an operation in which none of the particles crosses the torus, rather one simply winds around the other. Hence, these effects are relevant even for the planar case!</p>
|
||||
@ -225,7 +225,7 @@ This all works the same way for the amorphous lattice but the diagram is a lot m
|
||||
</figure>
|
||||
<p>Setting the overall energy scale with the constraint <span class="math inline">\(J_x + J_y + J_z = 1\)</span> yields a triangular phase diagram. In each of the corners one of the spin-coupling directions dominates, <span class="math inline">\(|J_\alpha > |J_\beta| + |J_\gamma|\)</span>, yielding three equivalent <span class="math inline">\(A_\alpha\)</span> phases while the central triangle around <span class="math inline">\(J_x = J_y = J_z\)</span> is called the B phase. Both phases support two kinds of quasiparticles, fermions and <span class="math inline">\(\mathbb{Z}_2\)</span>-vortices. In the A phases, the vortices have bosonic statistics with respect to themselves but act like fermions with respect to the fermions, hence they are Abelian anyons. This phase has the same anyonic structure as the Toric code <span class="citation" data-cites="kitaev_fault-tolerant_2003"> [<a href="#ref-kitaev_fault-tolerant_2003" role="doc-biblioref">29</a>]</span>. The B phase can be described as a semi-metal of the Majorana fermions <span class="citation" data-cites="TrebstPhysRep2022"> [<a href="#ref-TrebstPhysRep2022" role="doc-biblioref">5</a>]</span>. Since the B phase is gapless, the quasiparticles aren’t localised and so don’t have braiding statistics.</p>
|
||||
<p>An external magnetic can be used to break chiral symmetry. The lowest order term which breaks chiral symmetry but retains the solvability of the model is the three spin term <span class="math display">\[
|
||||
\sum_{(i,j,k)} \sigma_i^{\alpha} \sigma_j^{\beta} \sigma_k^{\gamma}
|
||||
\sum_{(i,j,k)} \sigma_i^{\alpha} \sigma_j^{\beta} \sigma_k^{\gamma},
|
||||
\]</span> where the sum <span class="math inline">\((i,j,k)\)</span> runs over consecutive indices around plaquettes. The addition of this to the spin model leads to two bond terms in the corresponding Majorana model. The effect of breaking chiral symmetry is to open a gap in the B phase. The vortices of the gapped B phase are non-Abelian anyons. This phase has the same anyonic exchange statistics as <span class="math inline">\(p_x + ip_y\)</span> superconductor <span class="citation" data-cites="readPairedStatesFermions2000"> [<a href="#ref-readPairedStatesFermions2000" role="doc-biblioref">38</a>]</span>, the Moore-Read state for the <span class="math inline">\(\nu = 5/2\)</span> fractional quantum Hall state <span class="citation" data-cites="mooreNonabelionsFractionalQuantum1991"> [<a href="#ref-mooreNonabelionsFractionalQuantum1991" role="doc-biblioref">39</a>]</span> and many other systems <span class="citation" data-cites="aliceaNonAbelianStatisticsTopological2011 fuSuperconductingProximityEffect2008 lutchynMajoranaFermionsTopological2010 oregHelicalLiquidsMajorana2010 sauGenericNewPlatform2010"> [<a href="#ref-aliceaNonAbelianStatisticsTopological2011" role="doc-biblioref">40</a>–<a href="#ref-sauGenericNewPlatform2010" role="doc-biblioref">44</a>]</span>. Collectively these systems have attracted interest as possible physical realisations for quantum computers whose operations are based on braiding operations.</p>
|
||||
<p>At finite temperatures, recent work has shown that the KH model undergoes a transition to a thermal metal phase. Vortex disorder causes the fermion gap to fill up and the DOS has a characteristic logarithmic divergence at zero energy which can be understood from random matrix theory <span class="citation" data-cites="selfThermallyInducedMetallic2019"> [<a href="#ref-selfThermallyInducedMetallic2019" role="doc-biblioref">11</a>]</span>.</p>
|
||||
<p>To surmise, the KH model is remarkable because it combines three key properties. First, the form of the Hamiltonian can plausibly be realised by a real material. Candidate materials, such as <span class="math inline">\(\alpha\mathrm{-RuCl}_3\)</span>, are known to have sufficiently strong spin-orbit coupling and the correct lattice structure to behave according to the KH model with small corrections <span class="citation" data-cites="banerjeeProximateKitaevQuantum2016 TrebstPhysRep2022"> [<a href="#ref-TrebstPhysRep2022" role="doc-biblioref">5</a>,<a href="#ref-banerjeeProximateKitaevQuantum2016" role="doc-biblioref">45</a>]</span>. Second, its ground state is the canonical example of the long sought after QSL state, its dynamical spin-spin correlation functions are zero beyond nearest neighbour separation <span class="citation" data-cites="baskaranExactResultsSpin2007"> [<a href="#ref-baskaranExactResultsSpin2007" role="doc-biblioref">46</a>]</span>. Its excitations are anyons, particles that can only exist in 2D that break the normal fermion/boson dichotomy. Third, and perhaps most importantly, this model is a rare many-body interacting quantum system that can be treated analytically. It is exactly solvable. We can explicitly write down its many-body ground states in terms of single particle states <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">1</a>]</span>. The solubility of the KH model, like the FK model, comes about because the model has extensively many conserved degrees of freedom. These conserved quantities can be factored out as classical degrees of freedom, leaving behind a non-interacting quantum model that is easy to solve.</p>
|
||||
|
@ -88,7 +88,7 @@ H = -t\sum_{\langle jk \rangle} c^\dagger_j c_k + \sum_j V_j c_j^\dagger c_j.
|
||||
<p>Localisation phenomena are strongly dimension dependent. In 3D the scaling theory of localisation <span class="citation" data-cites="edwardsNumericalStudiesLocalization1972 kramerLocalizationTheoryExperiment1993"> [<a href="#ref-edwardsNumericalStudiesLocalization1972" role="doc-biblioref">5</a>,<a href="#ref-kramerLocalizationTheoryExperiment1993" role="doc-biblioref">6</a>]</span> shows that Anderson localisation is a critical phenomenon with critical exponents both for how the conductivity vanishes with energy when approaching the mobility edge and for how the localisation length increases below it. By contrast, in 1D disorder generally dominates. Even the weakest disorder exponentially localises <em>all</em> single particle eigenstates in the 1D Anderson model. Only long-range spatial correlations of the disorder potential can induce delocalisation <span class="citation" data-cites="aubryAnalyticityBreakingAnderson1980 dassarmaLocalizationMobilityEdges1990 dunlapAbsenceLocalizationRandomdimer1990 izrailevLocalizationMobilityEdge1999 croyAndersonLocalization1D2011 izrailevAnomalousLocalizationLowDimensional2012"> [<a href="#ref-aubryAnalyticityBreakingAnderson1980" role="doc-biblioref">7</a>–<a href="#ref-izrailevAnomalousLocalizationLowDimensional2012" role="doc-biblioref">12</a>]</span>.</p>
|
||||
<p>Later localisation was found in disordered interacting many-body systems:</p>
|
||||
<p><span class="math display">\[
|
||||
H = -t\sum_{\langle jk \rangle} c^\dagger_j c_k + \sum_j V_j c_j^\dagger c_j + U\sum_{jk} n_j n_k
|
||||
H = -t\sum_{\langle jk \rangle} c^\dagger_j c_k + \sum_j V_j c_j^\dagger c_j + U\sum_{jk} n_j n_k.
|
||||
\]</span> Here, in contrast to the Anderson model, localisation phenomena are robust to weak perturbations of the Hamiltonian. This is called many-body localisation <span class="citation" data-cites="imbrieManyBodyLocalizationQuantum2016 gogolinEquilibrationThermalisationEmergence2016"> [<a href="#ref-imbrieManyBodyLocalizationQuantum2016" role="doc-biblioref">13</a>,<a href="#ref-gogolinEquilibrationThermalisationEmergence2016" role="doc-biblioref">14</a>]</span>.</p>
|
||||
<p>Both many-body localisation and Anderson localisation depend crucially on the presence of <em>quenched</em> disorder. Quenched disorder takes the form a static background field drawn from an arbitrary probability distribution to which the model is coupled. Disorder may also be introduced into the initial state of the system rather than the Hamiltonian. This has led to ongoing interest in the possibility of disorder-free localisation where the disorder is instead <em>annealed</em>. In this scenario, the disorder necessary to generate localisation is generated entirely from the thermal fluctuations of the model.</p>
|
||||
<p>The concept of disorder-free localisation was first proposed in the context of Helium mixtures <span class="citation" data-cites="kagan1984localization"> [<a href="#ref-kagan1984localization" role="doc-biblioref">15</a>]</span> and then extended to heavy-light mixtures in which multiple species with large mass ratios interact. The idea is that the heavier particles act as an effective disorder potential for the lighter ones, inducing localisation. Two such models <span class="citation" data-cites="yaoQuasiManyBodyLocalizationTranslationInvariant2016 schiulazDynamicsManybodyLocalized2015"> [<a href="#ref-yaoQuasiManyBodyLocalizationTranslationInvariant2016" role="doc-biblioref">16</a>,<a href="#ref-schiulazDynamicsManybodyLocalized2015" role="doc-biblioref">17</a>]</span> instead find that the models thermalise exponentially slowly in system size, which Ref. <span class="citation" data-cites="yaoQuasiManyBodyLocalizationTranslationInvariant2016"> [<a href="#ref-yaoQuasiManyBodyLocalizationTranslationInvariant2016" role="doc-biblioref">16</a>]</span> dubs Quasi-MBL.</p>
|
||||
@ -96,7 +96,7 @@ H = -t\sum_{\langle jk \rangle} c^\dagger_j c_k + \sum_j V_j c_j^\dagger c_j + U
|
||||
<p>In Chapter 3 we will consider a generalised FK model in 1D and study how the disorder generated near a 1D thermodynamic phase transition interacts with localisation physics.</p>
|
||||
<section id="topological-disorder" class="level2">
|
||||
<h2>Topological Disorder</h2>
|
||||
<p>So far we have considered disorder as a static or dynamic field coupled to a model defined on a translation invariant lattice. Another kind of disordered system that is worthy of study are amorphous systems. Amorphous systems have disordered bond connectivity, so called <em>topological disorder</em>. As discussed in the introduction these include amorphous semiconductors such as amorphous Germanium and Silicon <span class="citation" data-cites="Yonezawa1983 zallen2008physics Weaire1971 betteridge1973possible"> [<a href="#ref-Yonezawa1983" role="doc-biblioref">20</a>–<a href="#ref-betteridge1973possible" role="doc-biblioref">23</a>]</span>. While materials do not have long-range lattice structure they can enforce local constraints such as the approximate coordination number <span class="math inline">\(z = 4\)</span> of silicon.</p>
|
||||
<p>So far we have considered disorder as a static or dynamic field coupled to a model defined on a translation invariant lattice. Another kind of disordered system that is worthy of study are amorphous systems. Amorphous systems have disordered bond connectivity, so called <em>topological disorder</em>. As discussed in the introduction these include amorphous semiconductors such as amorphous germanium and silicon <span class="citation" data-cites="Yonezawa1983 zallen2008physics Weaire1971 betteridge1973possible"> [<a href="#ref-Yonezawa1983" role="doc-biblioref">20</a>–<a href="#ref-betteridge1973possible" role="doc-biblioref">23</a>]</span>. While materials do not have long-range lattice structure they can enforce local constraints such as the approximate coordination number <span class="math inline">\(z = 4\)</span> of silicon.</p>
|
||||
<p>Topological disorder can be qualitatively different from other disordered systems. Disordered graphs are constrained by fixed coordination number and the Euler equation. A standard method for generating such graphs with coordination number <span class="math inline">\(d+1\)</span> is Voronoi tessellation <span class="citation" data-cites="mitchellAmorphousTopologicalInsulators2018 marsalTopologicalWeaireThorpeModels2020"> [<a href="#ref-mitchellAmorphousTopologicalInsulators2018" role="doc-biblioref">24</a>,<a href="#ref-marsalTopologicalWeaireThorpeModels2020" role="doc-biblioref">25</a>]</span>. The Harris <span class="citation" data-cites="harrisEffectRandomDefects1974"> [<a href="#ref-harrisEffectRandomDefects1974" role="doc-biblioref">26</a>]</span> and the Imry-Mar <span class="citation" data-cites="imryRandomFieldInstabilityOrdered1975"> [<a href="#ref-imryRandomFieldInstabilityOrdered1975" role="doc-biblioref">27</a>]</span> criteria are key results on the effect of disorder on thermodynamic phase transitions. The Harris criterion signals when disorder will affect the universality of a thermodynamic critical point while the Imry-Ma criterion simply forbids the formation of long-range ordered states in <span class="math inline">\(d \leq 2\)</span> dimensions in the presence of disorder. Both these criteria are modified for the case of topological disorder. This is because the Euler equation and vertex degree constraints lead to strong anti-correlations which mean that topological disorder is effectively weaker than standard disorder in 2D <span class="citation" data-cites="barghathiPhaseTransitionsRandom2014 schrauthViolationHarrisBarghathiVojtaCriterion2018"> [<a href="#ref-barghathiPhaseTransitionsRandom2014" role="doc-biblioref">28</a>,<a href="#ref-schrauthViolationHarrisBarghathiVojtaCriterion2018" role="doc-biblioref">29</a>]</span>. This does not apply to 3D Voronoi lattices where the Euler equation contains an extra volume term and so is effectively a weaker constraint. It is worth exploring how QSLs and disorder interact. The KH model has been studied subject to both flux <span class="citation" data-cites="Nasu_Thermal_2015"> [<a href="#ref-Nasu_Thermal_2015" role="doc-biblioref">30</a>]</span> and bond <span class="citation" data-cites="knolle_dynamics_2016"> [<a href="#ref-knolle_dynamics_2016" role="doc-biblioref">31</a>]</span> disorder. In some instances it seems that disorder can even promote the formation of a QSL ground state <span class="citation" data-cites="wenDisorderedRouteCoulomb2017"> [<a href="#ref-wenDisorderedRouteCoulomb2017" role="doc-biblioref">32</a>]</span>. It has also been shown that the KH model exhibits disorder-free localisation after a quantum quench <span class="citation" data-cites="zhuSubdiffusiveDynamicsCritical2021"> [<a href="#ref-zhuSubdiffusiveDynamicsCritical2021" role="doc-biblioref">33</a>]</span>.</p>
|
||||
<p>In chapter 4 we will put the Kitaev model onto 2D Voronoi lattices and show that much of the rich character of the model is preserved despite the lack of long-range order.</p>
|
||||
</section>
|
||||
@ -111,18 +111,18 @@ H = -t\sum_{\langle jk \rangle} c^\dagger_j c_k + \sum_j V_j c_j^\dagger c_j + U
|
||||
<p>For low dimensional systems with quenched disorder, transfer matrix methods can be used to directly extract the localisation length. These work by turning the time independent Schrödinger equation <span class="math inline">\(\hat{H}|\psi\rangle = E|\psi\rangle\)</span> into a matrix equation linking the amplitude of <span class="math inline">\(\psi\)</span> on each <span class="math inline">\(d-1\)</span> dimensional slice of the system to the next and looking at average properties of this transmission matrix. This method is less useful for systems like the FK model where the disorder as a whole must be sampled from the thermodynamic ensemble.</p>
|
||||
<p>A more versatile method is based on the Inverse Participation Ratio (IPR). The IPR is defined for a normalised wave function <span class="math inline">\(\psi_i = \psi(x_i), \sum_i |\psi_i|^2 = 1\)</span> as its fourth moment <span class="citation" data-cites="kramerLocalizationTheoryExperiment1993"> [<a href="#ref-kramerLocalizationTheoryExperiment1993" role="doc-biblioref">6</a>]</span>:</p>
|
||||
<p><span class="math display">\[
|
||||
P^{-1} = \sum_i |\psi_i|^4
|
||||
P^{-1} = \sum_i |\psi_i|^4.
|
||||
\]</span></p>
|
||||
<p>The name derive from the fact that this operator acts as a measure of the volume where the wavefunction is significantly different from zero. They can alternatively be thought of as providing a measure of the average diameter <span class="math inline">\(R\)</span> from <span class="math inline">\(R = P^{1/d}\)</span>. See fig. <a href="#fig:localisation_radius_vs_length">1</a> for the distinction between <span class="math inline">\(R\)</span> and <span class="math inline">\(\lambda\)</span>.</p>
|
||||
<p>The name derives from the fact that this operator acts as a measure of the volume where the wavefunction is significantly different from zero. They can alternatively be thought of as providing a measure of the average diameter <span class="math inline">\(R\)</span> from <span class="math inline">\(R = P^{1/d}\)</span>. See fig. <a href="#fig:localisation_radius_vs_length">1</a> for the distinction between <span class="math inline">\(R\)</span> and <span class="math inline">\(\lambda\)</span>.</p>
|
||||
<p>For localised states, the <em>inverse</em> participation ratio <span class="math inline">\(P^{-1}\)</span> is independent of system size while for plane wave states in <span class="math inline">\(d\)</span> dimensions <span class="math inline">\(P^{-1} = L^{-d}\)</span>. States may also be intermediate between localised and extended, described by their fractal dimensionality <span class="math inline">\(d > d* > 0\)</span>:</p>
|
||||
<p><span class="math display">\[
|
||||
P(L)^{-1} \sim L^{-d*}
|
||||
P(L)^{-1} \sim L^{-d*}.
|
||||
\]</span></p>
|
||||
<p>Such intermediate states tend to appear as critical phenomena near mobility edges <span class="citation" data-cites="eversAndersonTransitions2008"> [<a href="#ref-eversAndersonTransitions2008" role="doc-biblioref">34</a>]</span>. For finite size systems, these relations only hold once the system size <span class="math inline">\(L\)</span> is much greater than the localisation length. When the localisation length is comparable to the system size, the states still contribute to transport, this is the aforementioned weak localisation effect <span class="citation" data-cites="altshulerMagnetoresistanceHallEffect1980 dattaElectronicTransportMesoscopic1995"> [<a href="#ref-altshulerMagnetoresistanceHallEffect1980" role="doc-biblioref">35</a>,<a href="#ref-dattaElectronicTransportMesoscopic1995" role="doc-biblioref">36</a>]</span>.</p>
|
||||
<p>In the following two chapters I will use an energy resolved IPR <span class="math display">\[
|
||||
\begin{aligned}
|
||||
DOS(\omega) &= \sum_n \delta(\omega - \epsilon_n)\\
|
||||
IPR(\omega) &= DOS(\omega)^{-1} \sum_{n,i} \delta(\omega - \epsilon_n) |\psi_{n,i}|^4
|
||||
IPR(\omega) &= DOS(\omega)^{-1} \sum_{n,i} \delta(\omega - \epsilon_n) |\psi_{n,i}|^4,
|
||||
\end{aligned}
|
||||
\]</span> where <span class="math inline">\(\psi_{n,i}\)</span> is the wavefunction corresponding to the energy <span class="math inline">\(\epsilon_n\)</span> at the ith site. In practice I bin the IPRs into finely spaced bins in energy space and use the mean IPR within each bin.</p>
|
||||
<section id="chapter-summary" class="level3">
|
||||
|
@ -98,11 +98,11 @@ image:
|
||||
<p>We interpret the FK model as a model of spinless fermions, <span class="math inline">\(c^\dagger_{i}\)</span>, hopping on a 1D lattice against a classical Ising spin background, <span class="math inline">\(S_i \in {\pm \frac{1}{2}}\)</span>. The fermions couple to the spins via an onsite interaction with strength <span class="math inline">\(U\)</span> which we supplement by a long-range interaction, <span class="math display">\[
|
||||
J_{ij} = 4\kappa J\; (-1)^{|i-j|} |i-j|^{-\alpha},
|
||||
\]</span></p>
|
||||
<p>between the spins. The additional coupling is very similar to that of the long-range Ising (LRI) model. It stabilises the Antiferromagnetic (AFM) order of the Ising spins which promotes the finite temperature CDW phase of the fermionic sector.</p>
|
||||
<p>The hopping strength of the electrons, <span class="math inline">\(t = 1\)</span>, sets the overall energy scale and we concentrate throughout on the particle-hole symmetric point at zero chemical potential and half filling <span class="citation" data-cites="gruberFalicovKimballModelReview1996"> [<a href="#ref-gruberFalicovKimballModelReview1996" role="doc-biblioref">6</a>]</span>.</p>
|
||||
<p>between the spins, see fig. <a href="#fig:lrfk_schematic">1</a>. The additional coupling is very similar to that of the long-range Ising (LRI) model. It stabilises the antiferromagnetic (AFM) order of the Ising spins which promotes the finite temperature CDW phase of the fermionic sector.</p>
|
||||
<p>The hopping strength of the electrons, <span class="math inline">\(t = 1\)</span>, sets the overall energy scale and we concentrate throughout on the particle-hole symmetric point at zero chemical potential and half-filling <span class="citation" data-cites="gruberFalicovKimballModelReview1996"> [<a href="#ref-gruberFalicovKimballModelReview1996" role="doc-biblioref">6</a>]</span>.</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
H_{\mathrm{FK}} = & \;U \sum_{i} S_i\;(c^\dagger_{i}c_{i} - \tfrac{1}{2}) -\;t \sum_{i} (c^\dagger_{i}c_{i+1} + \textit{h.c.)}\\
|
||||
& + \sum_{i, j}^{N} J_{ij} S_i S_j
|
||||
H_{\mathrm{FK}} = & \;U \sum_{i} S_i\;(c^\dagger_{i}c^{\phantom{\dagger}}_{i} - \tfrac{1}{2}) -\;t \sum_{i} (c^\dagger_{i}c^{\phantom{\dagger}}_{i+1} + \textit{h.c.)}\\
|
||||
& + \sum_{i, j}^{N} J_{ij} S_i S_j.
|
||||
\label{eq:HFK}\end{aligned}\]</span></p>
|
||||
<p>Without proper normalisation, the long-range coupling would render the critical temperature strongly system size dependent for small system sizes. Within a mean field approximation, the critical temperature scales with the effective coupling to all the neighbours of each site, which for a system with <span class="math inline">\(N\)</span> sites is <span class="math inline">\(\sum_{i=1}^{N} i^{-\alpha}\)</span>. Hence, the normalisation <span class="math inline">\(\kappa^{-1} = \sum_{i=1}^{N} i^{-\alpha}\)</span>, renders the critical temperature independent of system size in the mean field approximation. This greatly improves the finite size behaviour of the model.</p>
|
||||
<p>Taking the limit <span class="math inline">\(U = 0\)</span> decouples the spins from the fermions, which gives a spin sector governed by a classical long-range Ising model. Note, the transformation of the spins <span class="math inline">\(S_i \to (-1)^{i} S_i\)</span> maps the AFM model to the FM one. As discussed in the background section, Peierls’ classic argument can be extended to long-range couplings to show that, for the 1D LRI model, a power law decay of <span class="math inline">\(\alpha < 2\)</span> is required for a FTPT. This is because the energy of defect domain scales with the system size when the interactions are long-range and can overcome the entropic contribution. A renormalisation group analysis supports this finding and shows that the critical exponents are only universal for <span class="math inline">\(\alpha \leq 3/2\)</span> <span class="citation" data-cites="ruelleStatisticalMechanicsOnedimensional1968 thoulessLongRangeOrderOneDimensional1969 angeliniRelationsShortrangeLongrange2014"> [<a href="#ref-ruelleStatisticalMechanicsOnedimensional1968" role="doc-biblioref">7</a>–<a href="#ref-angeliniRelationsShortrangeLongrange2014" role="doc-biblioref">9</a>]</span>. In the following, we choose <span class="math inline">\(\alpha = 5/4\)</span> to avoid the additional complexity of non-universal critical points.</p>
|
||||
|
@ -80,51 +80,52 @@ image:
|
||||
<h2>Thermodynamics of the LRFK Model</h2>
|
||||
<figure>
|
||||
<img src="/assets/thesis/fk_chapter/raw_steps_single_flip.svg" id="fig-raw_steps_single_flip" data-short-caption="Comparison of different proposal distributions" style="width:100.0%" alt="Figure 1: Two Markov Chain Monte Carlo (MCMC) walks starting from the CDW state for a system with N = 100 sites and 10,000 MCMC steps but at a temperature close to but above the ordered state (left column) and much higher than it (right column). In this simulation, only a single spin can be flipped per step according to the Metropolis-Hastings Algorithm. The staggered magnetisation m = N^{-1} \sum_i (-1)^i \; S_i order parameter is plotted below. At both temperatures the thermal average of m is zero, while the initial state has m = 1. The higher temperature allows the MCMC to converge more quickly and to fluctuate about the mean with a shorter autocorrelation time. t = 1, \alpha = 1.25, T = {2.5,5}, J = U = 5" />
|
||||
<figcaption aria-hidden="true">Figure 1: Two Markov Chain Monte Carlo (MCMC) walks starting from the CDW state for a system with <span class="math inline">\(N = 100\)</span> sites and 10,000 MCMC steps but at a temperature close to but above the ordered state (left column) and much higher than it (right column). In this simulation, only a single spin can be flipped per step according to the Metropolis-Hastings Algorithm. The staggered magnetisation <span class="math inline">\(m = N^{-1} \sum_i (-1)^i \; S_i\)</span> order parameter is plotted below. At both temperatures the thermal average of m is zero, while the initial state has m = 1. The higher temperature allows the MCMC to converge more quickly and to fluctuate about the mean with a shorter autocorrelation time. <span class="math inline">\(t = 1, \alpha = 1.25, T = {2.5,5}, J = U = 5\)</span></figcaption>
|
||||
<figcaption aria-hidden="true">Figure 1: Two Markov Chain Monte Carlo (MCMC) walks starting from the CDW state for a system with <span class="math inline">\(N = 100\)</span> sites and 10,000 MCMC steps but at a temperature close to but above the ordered state (left column) and much higher than it (right column). In this simulation, only a single spin can be flipped per step according to the Metropolis-Hastings Algorithm. The staggered magnetisation <span class="math inline">\(m = N^{-1} \sum_i (-1)^i \; S_i\)</span> order parameter is plotted below. At both temperatures the thermal average of <span class="math inline">\(m\)</span> is zero, while the initial state has <span class="math inline">\(m = 1\)</span>. The higher temperature allows the MCMC to converge more quickly and to fluctuate about the mean with a shorter autocorrelation time. <span class="math inline">\(t = 1, \alpha = 1.25, T = {2.5,5}, J = U = 5\)</span></figcaption>
|
||||
</figure>
|
||||
<p>A classical Markov Chain Monte Carlo (MCMC) method allows us to solve our LRFK model efficiently, yielding unbiased estimates of thermal expectation values.</p>
|
||||
<p>A classical Markov Chain Monte Carlo (MCMC) method allows us to solve our LRFK model efficiently, yielding unbiased estimates of thermal expectation values, see fig. <a href="#fig:raw_steps_single_flip">1</a>.</p>
|
||||
<p>Since the spin configurations are classical, the LRFK Hamiltonian can be split into a classical spin part <span class="math inline">\(H_s\)</span> and an operator valued part <span class="math inline">\(H_c\)</span>.</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
H_s& = - \frac{U}{2}S_i + \sum_{i, j}^{N} J_{ij} S_i S_j \\
|
||||
H_c& = \sum_i U S_i c^\dagger_{i}c_{i} -t(c^\dagger_{i}c_{i+1} + c^\dagger_{i+1}c_{i}) \end{aligned}\]</span></p>
|
||||
H_c& = \sum_i U S_i c^\dagger_{i}c^{\phantom{\dagger}}_{i} -t(c^\dagger_{i}c^{\phantom{\dagger}}_{i+1} + c^\dagger_{i+1}c^{\phantom{\dagger}}_{i}). \end{aligned}\]</span></p>
|
||||
<p>The partition function can then be written as a sum over spin configurations, <span class="math inline">\(\vec{S} = (S_0, S_1...S_{N-1})\)</span>:</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
\mathcal{Z} = \mathrm{Tr} e^{-\beta H}= \sum_{\vec{S}} e^{-\beta H_s} \mathrm{Tr}_c e^{-\beta H_c} .\end{aligned}\]</span></p>
|
||||
<p>The contribution of <span class="math inline">\(H_c\)</span> to the grand canonical partition function can be obtained by performing the sum over eigenstate occupation numbers giving <span class="math inline">\(-\beta F_c[\vec{S}] = \sum_k \ln{(1 + e^{- \beta \epsilon_k})}\)</span> where <span class="math inline">\({\epsilon_k[\vec{S}]}\)</span> are the eigenvalues of the matrix representation of <span class="math inline">\(H_c\)</span> determined through exact diagonalisation. This gives a partition function containing a classical energy which corresponds to the long-range interaction of the spins, and a free energy which corresponds to the quantum subsystem.</p>
|
||||
<p>The contribution of <span class="math inline">\(H_c\)</span> to the grand canonical partition function can be obtained by performing the sum over eigenstate occupation numbers giving <span class="math inline">\(-\beta F_c[\vec{S}] = \sum_k \ln{(1 + e^{- \beta \epsilon_k})}\)</span> where <span class="math inline">\({\epsilon_k[\vec{S}]}\)</span> are the eigenvalues of the matrix representation of <span class="math inline">\(H_c\)</span> determined through exact diagonalisation. This gives a partition function containing a classical energy which corresponds to the long-range interaction of the spins, and a free energy which corresponds to the quantum subsystem</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
\mathcal{Z} = \sum_{\vec{S}} e^{-\beta H_S[\vec{S}] - \beta F_c[\vec{S}]} = \sum_{\vec{S}} e^{-\beta E[\vec{S}]}\end{aligned}\]</span></p>
|
||||
\mathcal{Z} = \sum_{\vec{S}} e^{-\beta H_S[\vec{S}] - \beta F_c[\vec{S}]} = \sum_{\vec{S}} e^{-\beta E[\vec{S}]}.\end{aligned}\]</span></p>
|
||||
</section>
|
||||
<section id="markov-chain-monte-carlo-and-emergent-disorder" class="level2">
|
||||
<h2>Markov Chain Monte Carlo and Emergent Disorder</h2>
|
||||
<p>Classical MCMC defines a weighted random walk over the spin states <span class="math inline">\((\vec{S}_0, \vec{S}_1, \vec{S}_2, ...)\)</span>, such that the likelihood of visiting a particular state converges to its Boltzmann probability <span class="math inline">\(p(\vec{S}) = \mathcal{Z}^{-1} e^{-\beta E}\)</span>. Hence, any observable can be estimated as a mean over the states visited by the walk <span class="citation" data-cites="binderGuidePracticalWork1988 kerteszAdvancesComputerSimulation1998 wolffMonteCarloErrors2004"> [<a href="#ref-binderGuidePracticalWork1988" role="doc-biblioref">4</a>–<a href="#ref-wolffMonteCarloErrors2004" role="doc-biblioref">6</a>]</span>,</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
\label{eq:thermal_expectation}
|
||||
\langle O \rangle & = \sum_{\vec{S}} p(\vec{S}) \langle O \rangle\\
|
||||
& = \sum_{i = 0}^{M} \langle O\rangle \pm \mathcal{O}(M^{-\tfrac{1}{2}})
|
||||
\langle O \rangle & = \sum_{\vec{S}} p(\vec{S}) \langle O \rangle_{\vec{S}}\\
|
||||
& = \sum_{i = 0}^{M} \langle O\rangle_{\vec{S}_i} \pm \mathcal{O}(M^{-\tfrac{1}{2}}),
|
||||
\end{aligned}\]</span></p>
|
||||
<p>where the former sum runs over the entire state space while the later runs over all the state visited by a particular MCMC run.</p>
|
||||
<p>where the former sum runs over the entire state space while the latter runs over all the states visited by a particular MCMC run,</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
\langle O \rangle_{\vec{S}}& = \sum_{\nu} n_F(\epsilon_{\nu}) \langle O \rangle{\nu}
|
||||
\langle O \rangle_{\vec{S}}& = \sum_{\nu} n_F(\epsilon_{\nu}) \langle O \rangle{\nu},
|
||||
\end{aligned}\]</span></p>
|
||||
<p>Where <span class="math inline">\(\nu\)</span> runs over the eigenstates of <span class="math inline">\(H_c\)</span> for a particular spin configuration and <span class="math inline">\(n_F(\epsilon) = \left(e^{-\beta\epsilon} + 1\right)^{-1}\)</span> is the Fermi function.</p>
|
||||
<p>where <span class="math inline">\(\nu\)</span> runs over the eigenstates of <span class="math inline">\(H_c\)</span> for a particular spin configuration and <span class="math inline">\(n_F(\epsilon) = \left(e^{-\beta\epsilon} + 1\right)^{-1}\)</span> is the Fermi function.</p>
|
||||
<p>The choice of the transition function for MCMC is under-determined as one only needs to satisfy a set of balance conditions for which there are many solutions <span class="citation" data-cites="kellyReversibilityStochasticNetworks1981"> [<a href="#ref-kellyReversibilityStochasticNetworks1981" role="doc-biblioref">7</a>]</span>. Here, we incorporate a modification to the standard Metropolis-Hastings algorithm <span class="citation" data-cites="hastingsMonteCarloSampling1970"> [<a href="#ref-hastingsMonteCarloSampling1970" role="doc-biblioref">8</a>]</span> gleaned from Krauth <span class="citation" data-cites="krauthIntroductionMonteCarlo1998"> [<a href="#ref-krauthIntroductionMonteCarlo1998" role="doc-biblioref">9</a>]</span>.</p>
|
||||
<p>The standard algorithm decomposes the transition probability into <span class="math inline">\(\mathcal{T}(a \to b) = p(a \to b)\mathcal{A}(a \to b)\)</span>. Here, <span class="math inline">\(p\)</span> is the proposal distribution, that we can directly sample from, while <span class="math inline">\(\mathcal{A}\)</span> is the acceptance probability. The standard Metropolis-Hastings choice is</p>
|
||||
<p><span class="math display">\[\mathcal{A}(a \to b) = \min\left(1, \frac{p(b\to a)}{p(a\to b)} e^{-\beta \Delta E}\right)\;,\]</span></p>
|
||||
<p>with <span class="math inline">\(\Delta E = E_b - E_a\)</span>. The walk then proceeds by sampling a state <span class="math inline">\(b\)</span> from <span class="math inline">\(p\)</span> and moving to <span class="math inline">\(b\)</span> with probability <span class="math inline">\(\mathcal{A}(a \to b)\)</span>. The latter operation is typically implemented by performing a transition if a uniform random sample from the unit interval is less than <span class="math inline">\(\mathcal{A}(a \to b)\)</span> and otherwise repeating the current state as the next step in the random walk. The proposal distribution is often symmetric, so it does not appear in <span class="math inline">\(\mathcal{A}\)</span>. Here, we flip a small number of sites in <span class="math inline">\(b\)</span> at random to generate proposals, which is a symmetric proposal.</p>
|
||||
<p>In our computations <span class="citation" data-cites="hodsonMCMCFKModel2021"> [<a href="#ref-hodsonMCMCFKModel2021" role="doc-biblioref">10</a>]</span>, we employ a modification to this algorithm based on the observation that the free energy of the FK system is composed of a classical part which is much quicker to compute than the quantum part. Hence, we can obtain a computational speed up by first considering the value of the classical energy difference <span class="math inline">\(\Delta H_s\)</span> and rejecting the transition if the former is too high. We only compute the quantum energy difference <span class="math inline">\(\Delta F_c\)</span> if the transition is accepted. We then perform a second rejection sampling step based upon it. This corresponds to two nested comparisons with the majority of the work only occurring if the first test passes. This modified scheme has the acceptance function <span class="math display">\[\mathcal{A}(a \to b) = \min\left(1, e^{-\beta \Delta H_s}\right)\min\left(1, e^{-\beta \Delta F_c}\right)\;.\]</span></p>
|
||||
<p>In our computations <span class="citation" data-cites="hodsonMCMCFKModel2021"> [<a href="#ref-hodsonMCMCFKModel2021" role="doc-biblioref">10</a>]</span>, we employ a modification to this algorithm based on the observation that the free energy of the FK system is composed of a classical part which is much quicker to compute than the quantum part. Hence, we can obtain a computational speed up by first considering the value of the classical energy difference <span class="math inline">\(\Delta H_s\)</span> and rejecting the transition if the former is too high. We only compute the quantum energy difference <span class="math inline">\(\Delta F_c\)</span> if the transition is accepted. We then perform a second rejection sampling step based upon it. This corresponds to two nested comparisons with the majority of the work only occurring if the first test passes. This modified scheme has the acceptance function</p>
|
||||
<p><span class="math display">\[\mathcal{A}(a \to b) = \min\left(1, e^{-\beta \Delta H_s}\right)\min\left(1, e^{-\beta \Delta F_c}\right).\]</span></p>
|
||||
<p>For the model parameters used, we find that with our new scheme the matrix diagonalisation is skipped around 30% of the time at <span class="math inline">\(T = 2.5\)</span> and up to 80% at <span class="math inline">\(T = 1.5\)</span>. We observe that for <span class="math inline">\(N = 50\)</span>, the matrix diagonalisation, if it occurs, occupies around 60% of the total computation time for a single step. This rises to 90% at N = 300 and further increases for larger N. We therefore get the greatest speedup for large system sizes at low temperature where many prospective transitions are rejected at the classical stage and the matrix computation takes up the greatest fraction of the total computation time. The upshot is that we find a speedup of up to a factor of 10 at the cost of very little extra algorithmic complexity.</p>
|
||||
<p>Our two-step method should be distinguished from the more common method for speeding up MCMC, which is to add asymmetry to the proposal distribution to make it as similar as possible to <span class="math inline">\(\min\left(1, e^{-\beta \Delta E}\right)\)</span>. This reduces the number of rejected states, which brings the algorithm closer in efficiency to a direct sampling method. However, it comes at the expense of requiring a way to directly sample from this complex distribution. This is a problem which MCMC was employed to solve in the first place. For example, recent work trains restricted Boltzmann machines (RBMs) to generate samples for the proposal distribution of the FK model <span class="citation" data-cites="huangAcceleratedMonteCarlo2017"> [<a href="#ref-huangAcceleratedMonteCarlo2017" role="doc-biblioref">11</a>]</span>. The RBMs are chosen as a parametrisation of the proposal distribution that can be efficiently sampled from, while offering sufficient flexibility that they can be adjusted to match the target distribution. Our proposed method is considerably simpler and does not require training while still reaping some of the benefits of reduced computation.</p>
|
||||
</section>
|
||||
<section id="scaling" class="level2">
|
||||
<h2>Scaling</h2>
|
||||
<figure>
|
||||
<img src="/assets/thesis/fk_chapter/binder_cumulants/binder_cumulants.svg" id="fig-binder_cumulants" data-short-caption="Binder Cumulants" style="width:100.0%" alt="Figure 2: (Left) The order parameters, \langle m^2 \rangle(solid) and 1 - f (dashed) describing the onset of the charge density wave phase of the LRFK model at low temperature with staggered magnetisation m = N^{-1} \sum_i (-1)^i S_i and fermionic order parameter f = 2 N^{-1}|\sum_i (-1)^i \; \langle c^\dagger_{i}c_{i}| \rangle . (Right) The crossing of the Binder cumulant, B = \langle m^4 \rangle / \langle m^2 \rangle^2, with system size provides a diagnostic that the phase transition is not a finite size effect, it is used to estimate the critical lines shown in the phase diagram later. All plots use system sizes N = [10,20,30,50,70,110,160,250] and lines are coloured from N = 10 in dark blue to N = 250 in yellow. The parameter values U = 5,\;J = 5,\;\alpha = 1.25 except where explicitly mentioned." />
|
||||
<figcaption aria-hidden="true">Figure 2: (Left) The order parameters, <span class="math inline">\(\langle m^2 \rangle\)</span>(solid) and <span class="math inline">\(1 - f\)</span> (dashed) describing the onset of the charge density wave phase of the LRFK model at low temperature with staggered magnetisation <span class="math inline">\(m = N^{-1} \sum_i (-1)^i S_i\)</span> and fermionic order parameter <span class="math inline">\(f = 2 N^{-1}|\sum_i (-1)^i \; \langle c^\dagger_{i}c_{i}| \rangle\)</span> . (Right) The crossing of the Binder cumulant, <span class="math inline">\(B = \langle m^4 \rangle / \langle m^2 \rangle^2\)</span>, with system size provides a diagnostic that the phase transition is not a finite size effect, it is used to estimate the critical lines shown in the phase diagram later. All plots use system sizes <span class="math inline">\(N = [10,20,30,50,70,110,160,250]\)</span> and lines are coloured from <span class="math inline">\(N = 10\)</span> in dark blue to <span class="math inline">\(N = 250\)</span> in yellow. The parameter values <span class="math inline">\(U = 5,\;J = 5,\;\alpha = 1.25\)</span> except where explicitly mentioned.</figcaption>
|
||||
<img src="/assets/thesis/fk_chapter/binder_cumulants/binder_cumulants.svg" id="fig-binder_cumulants" data-short-caption="Binder Cumulants" style="width:100.0%" alt="Figure 2: (Left) The order parameters, \langle m^2 \rangle (solid) and 1 - f (dashed) describing the onset of the charge density wave phase of the LRFK model at low temperature with staggered magnetisation m = N^{-1} \sum_i (-1)^i S_i and fermionic order parameter f = 2 N^{-1}|\sum_i (-1)^i \; \langle c^\dagger_{i}c^{\phantom{\dagger}}_{i}| \rangle . (Right) The crossing of the Binder cumulant, B = \langle m^4 \rangle / \langle m^2 \rangle^2, with system size provides a diagnostic that the phase transition is not a finite size effect, it is used to estimate the critical lines shown in the phase diagram later. All plots use system sizes N = [10,20,30,50,70,110,160,250] and lines are coloured from N = 10 in dark blue to N = 250 in yellow. The parameter values U = 5,\;J = 5,\;\alpha = 1.25 except where explicitly mentioned." />
|
||||
<figcaption aria-hidden="true">Figure 2: (Left) The order parameters, <span class="math inline">\(\langle m^2 \rangle\)</span> (solid) and <span class="math inline">\(1 - f\)</span> (dashed) describing the onset of the charge density wave phase of the LRFK model at low temperature with staggered magnetisation <span class="math inline">\(m = N^{-1} \sum_i (-1)^i S_i\)</span> and fermionic order parameter <span class="math inline">\(f = 2 N^{-1}|\sum_i (-1)^i \; \langle c^\dagger_{i}c^{\phantom{\dagger}}_{i}| \rangle\)</span> . (Right) The crossing of the Binder cumulant, <span class="math inline">\(B = \langle m^4 \rangle / \langle m^2 \rangle^2\)</span>, with system size provides a diagnostic that the phase transition is not a finite size effect, it is used to estimate the critical lines shown in the phase diagram later. All plots use system sizes <span class="math inline">\(N = [10,20,30,50,70,110,160,250]\)</span> and lines are coloured from <span class="math inline">\(N = 10\)</span> in dark blue to <span class="math inline">\(N = 250\)</span> in yellow. The parameter values <span class="math inline">\(U = 5,\;J = 5,\;\alpha = 1.25\)</span> except where explicitly mentioned.</figcaption>
|
||||
</figure>
|
||||
<p>To improve the scaling of finite size effects, we make the replacement <span class="math inline">\(|i - j|^{-\alpha} \rightarrow |f(i - j)|^{-\alpha}\)</span>, in both <span class="math inline">\(J_{ij}\)</span> and <span class="math inline">\(\kappa\)</span>, where <span class="math inline">\(f(x) = \frac{N}{\pi}\sin \frac{\pi x}{N}\)</span>. <span class="math inline">\(f\)</span> is smooth across the circular boundary and its effect diminished for larger systems <span class="citation" data-cites="fukuiOrderNClusterMonte2009"> [<a href="#ref-fukuiOrderNClusterMonte2009" role="doc-biblioref">12</a>]</span>. We only consider even system sizes given that odd system sizes are not commensurate with a CDW state.</p>
|
||||
<p>To identify critical points, I use the Binder cumulant <span class="math inline">\(U_B\)</span> defined by</p>
|
||||
<p><span class="math display">\[
|
||||
U_B = 1 - \frac{\langle\mu_4\rangle}{3\langle\mu_2\rangle^2}
|
||||
U_B = 1 - \frac{\langle\mu_4\rangle}{3\langle\mu_2\rangle^2},
|
||||
\]</span></p>
|
||||
<p>where <span class="math inline">\(\mu_n = \langle(m - \langle m\rangle)^n\rangle\)</span> are the central moments of the order parameter <span class="math inline">\(m = \sum_i (-1)^i (2n_i - 1) / N\)</span>. The Binder cumulant evaluated against temperature is a diagnostic for the existence of a phase transition. If multiple such curves are plotted for different system sizes, a crossing indicates the location of a critical point while the lines do not cross for systems that don’t have a phase transition in the thermodynamic limit <span class="citation" data-cites="binderFiniteSizeScaling1981 musialMonteCarloSimulations2002"> [<a href="#ref-binderFiniteSizeScaling1981" role="doc-biblioref">2</a>,<a href="#ref-musialMonteCarloSimulations2002" role="doc-biblioref">3</a>]</span>.</p>
|
||||
<p>Next Section: <a href="../3_Long_Range_Falicov_Kimball/3.3_LRFK_Results.html">Results</a></p>
|
||||
|
@ -78,7 +78,7 @@ image:
|
||||
<p>Looking at the results of our MCMC simulations, we find a rich phase diagram with a CDW FTPT and interaction-tuned Anderson versus Mott localised phases similar to the 2D FK model <span class="citation" data-cites="antipovInteractionTunedAndersonMott2016"> [<a href="#ref-antipovInteractionTunedAndersonMott2016" role="doc-biblioref">1</a>]</span>. We explore the localisation properties of the fermionic sector and find that the localisation lengths vary dramatically across the phases and for different energies. The results at moderate system sizes indicate the coexistence of localised and delocalised states within the CDW phase. We then introduce a model of uncorrelated binary disorder on a CDW background. This disorder model gives quantitatively similar behaviour to the LRFK model but we are able to simulate it on much larger systems. For these larger system sizes, we find that all states are eventually localised with a localisation length which diverges towards zero temperature indicating that the results at moderate system size suggestive of coexistence were due to weak localisation effects.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/fk_chapter/phase_diagram/phase_diagram.svg" id="fig-phase-diagram-lrfk" data-short-caption="Long-Range Falicov Kimball Model Phase Diagram" style="width:100.0%" alt="Figure 1: Phase diagrams of the 1D long-range Falicov-Kimball model. (Left) The TJ plane at U = 5: the CDW ordered phase is separated from a disordered Mott insulating phase by a critical temperature T_c, linear in J. (Right) On the TU plane at J = 5, the disordered phase is split into two. At large U there is a Mott insulator phase characterised by the presence of a gap at E=0 in the single particle energy spectrum. At small U there is an Anderson phase characterised by the absence of a gap. U_c is independent of temperature indicating that the FTPT is primarily driven by the long-range coupling term in J. At U = 0 the fermions are decoupled from the spins forming a simple Fermi gas." />
|
||||
<figcaption aria-hidden="true">Figure 1: Phase diagrams of the 1D long-range Falicov-Kimball model. (Left) The TJ plane at <span class="math inline">\(U = 5\)</span>: the CDW ordered phase is separated from a disordered Mott insulating phase by a critical temperature <span class="math inline">\(T_c\)</span>, linear in J. (Right) On the TU plane at <span class="math inline">\(J = 5\)</span>, the disordered phase is split into two. At large U there is a Mott insulator phase characterised by the presence of a gap at <span class="math inline">\(E=0\)</span> in the single particle energy spectrum. At small U there is an Anderson phase characterised by the absence of a gap. <span class="math inline">\(U_c\)</span> is independent of temperature indicating that the FTPT is primarily driven by the long-range coupling term in <span class="math inline">\(J\)</span>. At <span class="math inline">\(U = 0\)</span> the fermions are decoupled from the spins forming a simple Fermi gas.</figcaption>
|
||||
<figcaption aria-hidden="true">Figure 1: Phase diagrams of the 1D long-range Falicov-Kimball model. (Left) The TJ plane at <span class="math inline">\(U = 5\)</span>: the CDW ordered phase is separated from a disordered Mott insulating phase by a critical temperature <span class="math inline">\(T_c\)</span>, linear in <span class="math inline">\(J\)</span>. (Right) On the TU plane at <span class="math inline">\(J = 5\)</span>, the disordered phase is split into two. At large <span class="math inline">\(U\)</span> there is a Mott insulator phase characterised by the presence of a gap at <span class="math inline">\(E=0\)</span> in the single particle energy spectrum. At small <span class="math inline">\(U\)</span> there is an Anderson phase characterised by the absence of a gap. <span class="math inline">\(U_c\)</span> is independent of temperature indicating that the FTPT is primarily driven by the long-range coupling term in <span class="math inline">\(J\)</span>. At <span class="math inline">\(U = 0\)</span> the fermions are decoupled from the spins forming a simple Fermi gas.</figcaption>
|
||||
</figure>
|
||||
<section id="lrfk-results-phase-diagram" class="level2">
|
||||
<h2>Phase Diagram</h2>
|
||||
@ -87,15 +87,15 @@ image:
|
||||
<p>The CDW transition temperature is largely independent from the strength of the interaction <span class="math inline">\(U\)</span>. This demonstrates that the phase transition is driven by the long-range term <span class="math inline">\(J\)</span> with little effect from the coupling to the fermions <span class="math inline">\(U\)</span>. The physics of the spin sector in the LRFK model mimics that of the LRI model and is not significantly altered by the presence of the fermions. In 2D the transition to the CDW phase is mediated by an RKYY-like interaction <span class="citation" data-cites="rusinCalculationRKKYRange2017"> [<a href="#ref-rusinCalculationRKKYRange2017" role="doc-biblioref">3</a>]</span>. However, this is insufficient to stabilise long-range order in 1D. That the critical temperature <span class="math inline">\(T_c\)</span> does not depend on <span class="math inline">\(U\)</span> in our model further confirms this.</p>
|
||||
<p>The main order parameter for this model is the staggered magnetisation <span class="math inline">\(m = N^{-1} \sum_i (-1)^i S_i\)</span> that signals the onset of a CDW phase at low temperature. However, my main interest concerns the additional structure of the fermionic sector in the high temperature phase. Following Ref. <span class="citation" data-cites="antipovInteractionTunedAndersonMott2016"> [<a href="#ref-antipovInteractionTunedAndersonMott2016" role="doc-biblioref">1</a>]</span>, we can distinguish between the Mott and Anderson insulating phases. The Mott insulator is characterised by a gapped DOS in the absence of a CDW, instead the gap is driven entirely by interaction effect. Thus, the opening of a gap for large <span class="math inline">\(U\)</span> is distinct from the gap-opening induced by the translational symmetry breaking in the CDW state below <span class="math inline">\(T_c\)</span>. The Anderson phase is gapless but, as we explain below, shows localised fermionic eigenstates. It therefore has an insulating character.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/fk_chapter/DOS/DOS.svg" id="fig-DOS" data-short-caption="Energy resolved DOS($\omega$) in the difference phases." style="width:100.0%" alt="Figure 2: Energy resolved DOS(\omega) against system size N in all three phases. The Charge Density Wave (CDW) phase is shown in both the high and low U regime for completeness. The top left panel shows the Anderson phase at U = 2 and high T = 2.5, this phase is gapless, but does not conduct due to Anderson localisation. In the lower left pane at U = 2 and low T = 1.5, CDW order sets in, allowing the single particle eigenstates to become extended but opening a gap in their band structure. In the upper right panel at U = 5 and high T = 2.5, the states are localised by disorder and an interaction driven gap opens. This is a Mott insulator. Finally the CDW phase at U = 5 and T = 1.5 is qualitatively similar to the lower left panel except that the gap scales with U. For all the plots J = 5,\;\alpha = 1.25." />
|
||||
<figcaption aria-hidden="true">Figure 2: Energy resolved DOS(<span class="math inline">\(\omega\)</span>) against system size <span class="math inline">\(N\)</span> in all three phases. The Charge Density Wave (CDW) phase is shown in both the high and low <span class="math inline">\(U\)</span> regime for completeness. The top left panel shows the Anderson phase at <span class="math inline">\(U = 2\)</span> and high <span class="math inline">\(T = 2.5\)</span>, this phase is gapless, but does not conduct due to Anderson localisation. In the lower left pane at <span class="math inline">\(U = 2\)</span> and low <span class="math inline">\(T = 1.5\)</span>, CDW order sets in, allowing the single particle eigenstates to become extended but opening a gap in their band structure. In the upper right panel at <span class="math inline">\(U = 5\)</span> and high <span class="math inline">\(T = 2.5\)</span>, the states are localised by disorder and an interaction driven gap opens. This is a Mott insulator. Finally the CDW phase at <span class="math inline">\(U = 5\)</span> and <span class="math inline">\(T = 1.5\)</span> is qualitatively similar to the lower left panel except that the gap scales with <span class="math inline">\(U\)</span>. For all the plots <span class="math inline">\(J = 5,\;\alpha = 1.25\)</span>.</figcaption>
|
||||
<img src="/assets/thesis/fk_chapter/DOS/DOS.svg" id="fig-DOS" data-short-caption="Energy resolved DOS($\omega$) in the difference phases." style="width:100.0%" alt="Figure 2: Energy resolved DOS(\omega) against system size N in all three phases. The Charge Density Wave (CDW) phase is shown in both the high and low U regime for completeness. The top left panel shows the Anderson phase at U = 2 and high T = 2.5, this phase is gapless, but does not conduct due to Anderson localisation. In the lower left pane at U = 2 and low T = 1.5, CDW order sets in, allowing the single particle eigenstates to become extended but opening a gap in their band structure. In the upper right panel at U = 5 and high T = 2.5, the states are localised by disorder and an interaction driven gap opens. This is a Mott insulator. Finally, the CDW phase at U = 5 and T = 1.5 is qualitatively similar to the lower left panel except that the gap scales with U. For all the plots J = 5,\;\alpha = 1.25." />
|
||||
<figcaption aria-hidden="true">Figure 2: Energy resolved DOS(<span class="math inline">\(\omega\)</span>) against system size <span class="math inline">\(N\)</span> in all three phases. The Charge Density Wave (CDW) phase is shown in both the high and low <span class="math inline">\(U\)</span> regime for completeness. The top left panel shows the Anderson phase at <span class="math inline">\(U = 2\)</span> and high <span class="math inline">\(T = 2.5\)</span>, this phase is gapless, but does not conduct due to Anderson localisation. In the lower left pane at <span class="math inline">\(U = 2\)</span> and low <span class="math inline">\(T = 1.5\)</span>, CDW order sets in, allowing the single particle eigenstates to become extended but opening a gap in their band structure. In the upper right panel at <span class="math inline">\(U = 5\)</span> and high <span class="math inline">\(T = 2.5\)</span>, the states are localised by disorder and an interaction driven gap opens. This is a Mott insulator. Finally, the CDW phase at <span class="math inline">\(U = 5\)</span> and <span class="math inline">\(T = 1.5\)</span> is qualitatively similar to the lower left panel except that the gap scales with <span class="math inline">\(U\)</span>. For all the plots <span class="math inline">\(J = 5,\;\alpha = 1.25\)</span>.</figcaption>
|
||||
</figure>
|
||||
</section>
|
||||
<section id="localisation-properties" class="level2">
|
||||
<h2>Localisation Properties</h2>
|
||||
<figure>
|
||||
<img src="/assets/thesis/fk_chapter/DOS/IPR_scaling.svg" id="fig-IPR_scaling" data-short-caption="Scaling of IPR($\omega$) against system size $N$." style="width:100.0%" alt="Figure 3: The IPR(\omega) scaling with N at fixed energy for each phase and for points both in the gap (\omega_0) and in a band (\omega_1). The slope of the line yields the scaling exponent \tau defined by \mathrm{IPR} \propto N^{-\tau}). \tau close to zero implies that the states at that energy are localised while \tau = -d corresponds to extended states where d is the system dimension. All but the bands of the charge density wave phase are approximately localised with \tau is very close to zero. The bands in the charge density wave phase are localised with long localisation lengths at finite temperatures that extend to infinity as the temperature approaches zero. For all the plots J = 5,\;\alpha = 1.25. The measured \tau_0,\tau_1 for each figure are: (a) (0.06\pm0.01, 0.02\pm0.01 (b) 0.04\pm0.02, 0.00\pm0.01 (c) 0.05\pm0.03, 0.30\pm0.03 (d) 0.06\pm0.04, 0.15\pm0.05 We show later that the apparent slight scaling of the IPR with system size in the localised cases can be explained by finite size effects due to the changing defect density with system size rather than due to delocalisation of the states." />
|
||||
<figcaption aria-hidden="true">Figure 3: The IPR(<span class="math inline">\(\omega\)</span>) scaling with <span class="math inline">\(N\)</span> at fixed energy for each phase and for points both in the gap (<span class="math inline">\(\omega_0\)</span>) and in a band (<span class="math inline">\(\omega_1\)</span>). The slope of the line yields the scaling exponent <span class="math inline">\(\tau\)</span> defined by <span class="math inline">\(\mathrm{IPR} \propto N^{-\tau}\)</span>). <span class="math inline">\(\tau\)</span> close to zero implies that the states at that energy are localised while <span class="math inline">\(\tau = -d\)</span> corresponds to extended states where <span class="math inline">\(d\)</span> is the system dimension. All but the bands of the charge density wave phase are approximately localised with <span class="math inline">\(\tau\)</span> is very close to zero. The bands in the charge density wave phase are localised with long localisation lengths at finite temperatures that extend to infinity as the temperature approaches zero. For all the plots <span class="math inline">\(J = 5,\;\alpha = 1.25\)</span>. The measured <span class="math inline">\(\tau_0,\tau_1\)</span> for each figure are: (a) <span class="math inline">\((0.06\pm0.01, 0.02\pm0.01\)</span> (b) <span class="math inline">\(0.04\pm0.02, 0.00\pm0.01\)</span> (c) <span class="math inline">\(0.05\pm0.03, 0.30\pm0.03\)</span> (d) <span class="math inline">\(0.06\pm0.04, 0.15\pm0.05\)</span> We show later that the apparent slight scaling of the IPR with system size in the localised cases can be explained by finite size effects due to the changing defect density with system size rather than due to delocalisation of the states.</figcaption>
|
||||
<img src="/assets/thesis/fk_chapter/DOS/IPR_scaling.svg" id="fig-IPR_scaling" data-short-caption="Scaling of IPR($\omega$) against system size $N$." style="width:100.0%" alt="Figure 3: The IPR(\omega) scaling with N at fixed energy for each phase and for points both in the gap (\omega_0) and in a band (\omega_1). The slope of the line yields the scaling exponent \tau defined by \mathrm{IPR} \propto N^{-\tau}. \tau close to zero implies that the states at that energy are localised while \tau = -d corresponds to extended states where d is the system dimension. All but the bands of the charge density wave phase are approximately localised with \tau is very close to zero. The bands in the charge density wave phase are localised with long localisation lengths at finite temperatures that extend to infinity as the temperature approaches zero. For all the plots J = 5,\;\alpha = 1.25. The measured \tau_0,\tau_1 for each figure are: (a) 0.06\pm0.01, 0.02\pm0.01 (b) 0.04\pm0.02, 0.00\pm0.01 (c) 0.05\pm0.03, 0.30\pm0.03 (d) 0.06\pm0.04, 0.15\pm0.05 We show later that the apparent slight scaling of the IPR with system size in the localised cases can be explained by finite size effects due to the changing defect density with system size rather than due to delocalisation of the states." />
|
||||
<figcaption aria-hidden="true">Figure 3: The IPR(<span class="math inline">\(\omega\)</span>) scaling with <span class="math inline">\(N\)</span> at fixed energy for each phase and for points both in the gap (<span class="math inline">\(\omega_0\)</span>) and in a band (<span class="math inline">\(\omega_1\)</span>). The slope of the line yields the scaling exponent <span class="math inline">\(\tau\)</span> defined by <span class="math inline">\(\mathrm{IPR} \propto N^{-\tau}\)</span>. <span class="math inline">\(\tau\)</span> close to zero implies that the states at that energy are localised while <span class="math inline">\(\tau = -d\)</span> corresponds to extended states where <span class="math inline">\(d\)</span> is the system dimension. All but the bands of the charge density wave phase are approximately localised with <span class="math inline">\(\tau\)</span> is very close to zero. The bands in the charge density wave phase are localised with long localisation lengths at finite temperatures that extend to infinity as the temperature approaches zero. For all the plots <span class="math inline">\(J = 5,\;\alpha = 1.25\)</span>. The measured <span class="math inline">\(\tau_0,\tau_1\)</span> for each figure are: (a) <span class="math inline">\(0.06\pm0.01, 0.02\pm0.01\)</span> (b) <span class="math inline">\(0.04\pm0.02, 0.00\pm0.01\)</span> (c) <span class="math inline">\(0.05\pm0.03, 0.30\pm0.03\)</span> (d) <span class="math inline">\(0.06\pm0.04, 0.15\pm0.05\)</span> We show later that the apparent slight scaling of the IPR with system size in the localised cases can be explained by finite size effects due to the changing defect density with system size rather than due to delocalisation of the states.</figcaption>
|
||||
</figure>
|
||||
<p>The MCMC formulation suggests viewing the spin configurations as a form of annealed binary disorder whose probability distribution is given by the Boltzmann weight <span class="math inline">\(e^{-\beta H_S[\vec{S}] - \beta F_c[\vec{S}]}\)</span>. This makes apparent the link to the study of disordered systems and Anderson localisation. These systems are typically studied by defining the probability distribution for the quenched disorder potential externally. Here, by contrast, we have a translation invariant system with disorder as a natural consequence of the Ising background field conserved by the dynamics.</p>
|
||||
<figure>
|
||||
@ -104,14 +104,14 @@ image:
|
||||
</figure>
|
||||
<p>In the limits of zero and infinite temperature, our model becomes a simple tight-binding model for the fermions. At zero temperature, the spin background is in one of the two translation invariant AFM ground states with two gapped fermionic CDW bands at energies</p>
|
||||
<p><span class="math display">\[E_{\pm} = \pm\sqrt{\frac{1}{4}U^2 + 2t^2(1 + \cos ka)^2}\;.\]</span></p>
|
||||
<p>At infinite temperature, all the spin configurations become equally likely and the fermionic model reduces to one of binary uncorrelated disorder in which all eigenstates are Anderson localised <span class="citation" data-cites="abrahamsScalingTheoryLocalization1979"> [<a href="#ref-abrahamsScalingTheoryLocalization1979" role="doc-biblioref">4</a>]</span>. An Anderson localised state, centred around <span class="math inline">\(r_0\)</span>, has magnitude that drops exponentially over some localisation length <span class="math inline">\(\xi\)</span> i.e <span class="math inline">\(|\psi(r)|^2 \sim \exp{-|r - r_0|/\xi}\)</span>. Calculating <span class="math inline">\(\xi\)</span> directly is numerically demanding. Therefore, we determine if a given state is localised via the energy-resolved IPR and the DOS defined as</p>
|
||||
<p>At infinite temperature, all the spin configurations become equally likely and the fermionic model reduces to one of binary uncorrelated disorder in which all eigenstates are Anderson localised <span class="citation" data-cites="abrahamsScalingTheoryLocalization1979"> [<a href="#ref-abrahamsScalingTheoryLocalization1979" role="doc-biblioref">4</a>]</span>. An Anderson localised state, centred around <span class="math inline">\(r_0\)</span>, has magnitude that drops exponentially over some localisation length <span class="math inline">\(\xi\)</span> i.e., <span class="math inline">\(|\psi(r)|^2 \sim \exp{-|r - r_0|/\xi}\)</span>. Calculating <span class="math inline">\(\xi\)</span> directly is numerically demanding. Therefore, we determine if a given state is localised via the energy-resolved IPR and the DOS defined as</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
\mathrm{DOS}(\vec{S}, \omega)& = N^{-1} \sum_{i} \delta(\epsilon_i - \omega)\\
|
||||
\mathrm{IPR}(\vec{S}, \omega)& = \; N^{-1} \mathrm{DOS}(\vec{S}, \omega)^{-1} \sum_{i,j} \delta(\epsilon_i - \omega)\;\psi^{4}_{i,j}\end{aligned}\]</span></p>
|
||||
\mathrm{IPR}(\vec{S}, \omega)& = \; N^{-1} \mathrm{DOS}(\vec{S}, \omega)^{-1} \sum_{i,j} \delta(\epsilon_i - \omega)\;\psi^{4}_{i,j},\end{aligned}\]</span></p>
|
||||
<p>where <span class="math inline">\(\epsilon_i\)</span> and <span class="math inline">\(\psi_{i,j}\)</span> are the <span class="math inline">\(i\)</span>th energy level and <span class="math inline">\(j\)</span>th element of the corresponding eigenfunction, both dependent on the background spin configuration <span class="math inline">\(\vec{S}\)</span>.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/fk_chapter/gap_opening/gap_opening_U2.svg" id="fig-gap_opening_U2" data-short-caption="The transition from CDW to the Anderson Phase" style="width:100.0%" alt="Figure 5: The DOS (a) and scaling exponent \tau (b) as a function of energy for the CDW phase to the gapless Anderson insulating phase at U=2. Regions where the DOS is close to zero are shown in white. The scaling exponent \tau is obtained from fits to IPR(N) = A N^{-\lambda} for a range of system sizes. J = 5,\;\alpha = 1.25" />
|
||||
<figcaption aria-hidden="true">Figure 5: The DOS (a) and scaling exponent <span class="math inline">\(\tau\)</span> (b) as a function of energy for the CDW phase to the gapless Anderson insulating phase at <span class="math inline">\(U=2\)</span>. Regions where the DOS is close to zero are shown in white. The scaling exponent <span class="math inline">\(\tau\)</span> is obtained from fits to <span class="math inline">\(IPR(N) = A N^{-\lambda}\)</span> for a range of system sizes. <span class="math inline">\(J = 5,\;\alpha = 1.25\)</span></figcaption>
|
||||
<img src="/assets/thesis/fk_chapter/gap_opening/gap_opening_U2.svg" id="fig-gap_opening_U2" data-short-caption="The transition from CDW to the Anderson Phase" style="width:100.0%" alt="Figure 5: The DOS (a) and scaling exponent \tau (b) as a function of energy for the CDW phase to the gapless Anderson insulating phase at U=2. Regions where the DOS is close to zero are shown in white. The scaling exponent \tau is obtained from fits to \mathrm{IPR}(N) = A N^{-\lambda} for a range of system sizes. J = 5,\;\alpha = 1.25" />
|
||||
<figcaption aria-hidden="true">Figure 5: The DOS (a) and scaling exponent <span class="math inline">\(\tau\)</span> (b) as a function of energy for the CDW phase to the gapless Anderson insulating phase at <span class="math inline">\(U=2\)</span>. Regions where the DOS is close to zero are shown in white. The scaling exponent <span class="math inline">\(\tau\)</span> is obtained from fits to <span class="math inline">\(\mathrm{IPR}(N) = A N^{-\lambda}\)</span> for a range of system sizes. <span class="math inline">\(J = 5,\;\alpha = 1.25\)</span></figcaption>
|
||||
</figure>
|
||||
<p>The scaling of the IPR with system size <span class="math inline">\(\mathrm{IPR} \propto N^{-\tau}\)</span> depends on the localisation properties of states at that energy. For delocalised states, e.g. Bloch waves, <span class="math inline">\(\tau\)</span> is the physical dimension. For fully localised states <span class="math inline">\(\tau\)</span> goes to zero in the thermodynamic limit. However, for special types of disorder, such as binary disorder, the localisation lengths can be large, comparable to the system size. This can make it difficult to extract the correct scaling. An additional complication arises from the fact that the scaling exponent may display intermediate behaviours for correlated disorder and in the vicinity of a localisation-delocalisation transition <span class="citation" data-cites="kramerLocalizationTheoryExperiment1993 eversAndersonTransitions2008"> [<a href="#ref-kramerLocalizationTheoryExperiment1993" role="doc-biblioref">5</a>,<a href="#ref-eversAndersonTransitions2008" role="doc-biblioref">6</a>]</span>. The thermal defects of the CDW phase lead to a binary disorder potential with a finite correlation length. The key question for our system is then: How is the <span class="math inline">\(T=0\)</span> CDW phase with fully delocalised fermionic states connected to the fully localised phase at high temperatures?</p>
|
||||
<figure>
|
||||
@ -124,11 +124,11 @@ image:
|
||||
<figcaption aria-hidden="true">Figure 7: A comparison of the full FK model to a simple binary disorder model with a CDW wave background perturbed by uncorrelated defects at density <span class="math inline">\(0 < \rho < 1\)</span> matched to the <span class="math inline">\(\rho\)</span> for the largest corresponding FK model. As in fig. <a href="#fig:IPR_scaling">3</a> <span class="math inline">\(\tau(\omega)\)</span> the scaling of IPR(<span class="math inline">\(\omega\)</span>) with system size, is shown both in gap (<span class="math inline">\(\omega_0\)</span>) and in the band (<span class="math inline">\(\omega_1\)</span>). This data makes clear that the apparent scaling of IPR with system size at small sizes is a finite size effect due to weak localisation <span class="citation" data-cites="antipovInteractionTunedAndersonMott2016"> [<a href="#ref-antipovInteractionTunedAndersonMott2016" role="doc-biblioref">1</a>]</span>. Hence all the states are indeed localised as one would expect in 1D. The disorder model <span class="math inline">\(\tau_0,\tau_1\)</span> for each figure are: (a) <span class="math inline">\(0.01\pm0.05, -0.02\pm0.06\)</span> (b) <span class="math inline">\(0.01\pm0.04, -0.01\pm0.04\)</span> (c) <span class="math inline">\(0.05\pm0.06, 0.04\pm0.06\)</span> (d) <span class="math inline">\(-0.03\pm0.06, 0.01\pm0.06\)</span>. The lines are fit on system sizes <span class="math inline">\(N > 400\)</span></figcaption>
|
||||
</figure>
|
||||
<p>In the CDW phases, at <span class="math inline">\(U=2\)</span> and <span class="math inline">\(U=5\)</span>, we find that states within the gapped CDW bands, e.g. at <span class="math inline">\(\omega_1\)</span>, have scaling exponents <span class="math inline">\(\tau = 0.30\pm0.03\)</span> and <span class="math inline">\(\tau = 0.15\pm0.05\)</span>, respectively. This surprising finding suggests that the CDW bands are partially delocalised with multi-fractal behaviour of the wavefunctions <span class="citation" data-cites="eversAndersonTransitions2008"> [<a href="#ref-eversAndersonTransitions2008" role="doc-biblioref">6</a>]</span>. This phenomenon would be unexpected in a 1D model as they generally do not support delocalisation in the presence of disorder except as the result of correlations in the emergent disorder potential <span class="citation" data-cites="croyAndersonLocalization1D2011 goldshteinPurePointSpectrum1977"> [<a href="#ref-croyAndersonLocalization1D2011" role="doc-biblioref">7</a>,<a href="#ref-goldshteinPurePointSpectrum1977" role="doc-biblioref">8</a>]</span>. However, we show later, by comparison to an uncorrelated Anderson model, that these nonzero exponents are a finite size effect and the states are localised with a finite <span class="math inline">\(\xi\)</span> similar to the system size. This is an example of weak localisation. As a result, the IPR does not scale correctly until the system size has grown much larger than <span class="math inline">\(\xi\)</span>. Fig. <a href="#fig:DM_IPR_scaling">7</a> shows that the scaling of the IPR in the CDW phase does flatten out eventually.</p>
|
||||
<p>Next, we use the DOS and the scaling exponent <span class="math inline">\(\tau\)</span> to explore the localisation properties over the energy-temperature plane in fig. <a href="#fig:gap_opening_U2">5</a> and fig. <a href="#fig:gap_opening_U5">4</a>. Gapped areas are shown in white, which highlights the distinction between the gapped Mott phase and the ungapped Anderson phase. In-gap states appear just below the critical point, smoothly filling the bandgap in the Anderson phase and forming islands in the Mott phase. As in the finite <span class="citation" data-cites="zondaGaplessRegimeCharge2019"> [<a href="#ref-zondaGaplessRegimeCharge2019" role="doc-biblioref">9</a>]</span> and infinite dimensional <span class="citation" data-cites="hassanSpectralPropertiesChargedensitywave2007"> [<a href="#ref-hassanSpectralPropertiesChargedensitywave2007" role="doc-biblioref">10</a>]</span> cases, the in-gap states merge and are pushed to lower energy for decreasing U as the <span class="math inline">\(T=0\)</span> CDW gap closes. Intuitively, the presence of in-gap states can be understood as a result of domain wall fluctuations away from the AFM ordered background. These domain walls act as local potentials for impurity-like bound states <span class="citation" data-cites="zondaGaplessRegimeCharge2019"> [<a href="#ref-zondaGaplessRegimeCharge2019" role="doc-biblioref">9</a>]</span>.</p>
|
||||
<p>To understand the localisation properties we can compare the behaviour of our model with that of a simpler Anderson disorder model in which the spins are replaced by a CDW background with uncorrelated binary defect potentials. This is defined by replacing the spin degree of freedom in the FK model <span class="math inline">\(S_i = \pm \tfrac{1}{2}\)</span> with a disorder potential <span class="math inline">\(d_i = \pm \tfrac{1}{2}\)</span> controlled by a defect density <span class="math inline">\(\rho\)</span> such that <span class="math inline">\(d_i = -\tfrac{1}{2}\)</span> with probability <span class="math inline">\(\rho/2\)</span> and <span class="math inline">\(d_i = \tfrac{1}{2}\)</span> otherwise. <span class="math inline">\(\rho/2\)</span> is used rather than <span class="math inline">\(\rho\)</span> so that the disorder potential takes on the zero temperature CDW ground state at <span class="math inline">\(\rho = 0\)</span> and becomes a random choice over spin states at <span class="math inline">\(\rho = 1\)</span> i.e the infinite temperature limit.</p>
|
||||
<p>Next, we use the DOS and the scaling exponent <span class="math inline">\(\tau\)</span> to explore the localisation properties over the energy-temperature plane in fig. <a href="#fig:gap_opening_U2">5</a> and fig. <a href="#fig:gap_opening_U5">4</a>. Gapped areas are shown in white, which highlights the distinction between the gapped Mott phase and the ungapped Anderson phase. In-gap states appear just below the critical point, smoothly filling the bandgap in the Anderson phase and forming islands in the Mott phase. As in the finite <span class="citation" data-cites="zondaGaplessRegimeCharge2019"> [<a href="#ref-zondaGaplessRegimeCharge2019" role="doc-biblioref">9</a>]</span> and infinite dimensional <span class="citation" data-cites="hassanSpectralPropertiesChargedensitywave2007"> [<a href="#ref-hassanSpectralPropertiesChargedensitywave2007" role="doc-biblioref">10</a>]</span> cases, the in-gap states merge and are pushed to lower energy for decreasing <span class="math inline">\(U\)</span> as the <span class="math inline">\(T=0\)</span> CDW gap closes. Intuitively, the presence of in-gap states can be understood as a result of domain wall fluctuations away from the AFM ordered background. These domain walls act as local potentials for impurity-like bound states <span class="citation" data-cites="zondaGaplessRegimeCharge2019"> [<a href="#ref-zondaGaplessRegimeCharge2019" role="doc-biblioref">9</a>]</span>.</p>
|
||||
<p>To understand the localisation properties we can compare the behaviour of our model with that of a simpler Anderson disorder model in which the spins are replaced by a CDW background with uncorrelated binary defect potentials. This is defined by replacing the spin degree of freedom in the FK model <span class="math inline">\(S_i = \pm \tfrac{1}{2}\)</span> with a disorder potential <span class="math inline">\(d_i = \pm \tfrac{1}{2}\)</span> controlled by a defect density <span class="math inline">\(\rho\)</span> such that <span class="math inline">\(d_i = -\tfrac{1}{2}\)</span> with probability <span class="math inline">\(\rho/2\)</span> and <span class="math inline">\(d_i = \tfrac{1}{2}\)</span> otherwise. <span class="math inline">\(\rho/2\)</span> is used rather than <span class="math inline">\(\rho\)</span> so that the disorder potential takes on the zero temperature CDW ground state at <span class="math inline">\(\rho = 0\)</span> and becomes a random choice over spin states at <span class="math inline">\(\rho = 1\)</span> i.e., the infinite temperature limit.</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
H_{\mathrm{DM}} = & \;U \sum_{i} (-1)^i \; d_i \;(c^\dagger_{i}c_{i} - \tfrac{1}{2}) \\
|
||||
& -\;t \sum_{i} c^\dagger_{i}c_{i+1} + c^\dagger_{i+1}c_{i}
|
||||
H_{\mathrm{DM}} = & \;U \sum_{i} (-1)^i \; d_i \;(c^\dagger_{i}c^{\phantom{\dagger}}_{i} - \tfrac{1}{2}) \\
|
||||
& -\;t \sum_{i} c^\dagger_{i}c^{\phantom{\dagger}}_{i+1} + c^\dagger_{i+1}c^{\phantom{\dagger}}_{i},
|
||||
\end{aligned}\]</span></p>
|
||||
<p>Figures -fig. <a href="#fig:DM_DOS">6</a> and -fig. <a href="#fig:DM_IPR_scaling">7</a> compare the FK model to the disorder model at different system sizes, matching the defect densities of the disorder model to the FK model at <span class="math inline">\(N = 270\)</span> above and below the CDW transition. We find very good, even quantitative, agreement between the FK and disorder models. This suggests that correlations in the spin sector do not play a significant role.</p>
|
||||
<p>As we can sample directly from the disorder model, rather than through MCMC, the samples are uncorrelated. Hence we can evaluate much larger system sizes with the disorder model. This enables us to pin down the correct localisation effects. In particular, what appear to be delocalised states for small system sizes eventually turn out to be states with large localisation length. The localisation length diverges towards the ordered zero temperature CDW state. The interplay of interactions, which here produces a binary potential, and localisation can be very intricate and the added advantage of a 1D model is that we can explore very large system sizes.</p>
|
||||
|
@ -87,13 +87,13 @@ image:
|
||||
<p>The material in this chapter expands on work presented in</p>
|
||||
<p> <span class="citation" data-cites="cassellaExactChiralAmorphous2022"> [<a href="#ref-cassellaExactChiralAmorphous2022" role="doc-biblioref">1</a>]</span> Cassella, G., D’Ornellas, P., Hodson, T., Natori, W. M., & Knolle, J. (2022). An exact chiral amorphous spin liquid. <em>arXiv preprint arXiv:2208.08246.</em></p>
|
||||
<p>All the code is available online as a Python package called Koala <span class="citation" data-cites="hodsonKoalaKitaevAmorphous2022"> [<a href="#ref-hodsonKoalaKitaevAmorphous2022" role="doc-biblioref">2</a>]</span>.</p>
|
||||
<p>This was a joint project of Gino, Peru and me with advice and guidance from Willian and Johannes, all authors of the above. The project grew out of an interest the three of us had in studying amorphous systems, coupled with Johannes’ expertise on the Kitaev model. The idea to use Voronoi partitions came from <span class="citation" data-cites="marsalTopologicalWeaireThorpe2020"> [<a href="#ref-marsalTopologicalWeaireThorpe2020" role="doc-biblioref">3</a>]</span> and Gino did the implementation of this. The idea and implementation of the edge colouring using SAT solvers and the mapping from flux sector to bond sector using A* search were both entirely my work. Peru produced the numerical evidence for the ground state and implemented the local markers. Gino and I did much of the rest of the programming for Koala collaboratively, often pair programming, this included the phase diagram, edge mode and finite temperature analyses as well as the derivation of the projector in the amorphous case.</p>
|
||||
<p>This was a joint project of Gino, Peru and me with advice and guidance from Willian and Johannes, all authors of the above. The project grew out of an interest the three of us had in studying amorphous systems, coupled with Johannes’ expertise on the Kitaev model. The idea to use Voronoi partitions came from ref. <span class="citation" data-cites="marsalTopologicalWeaireThorpe2020"> [<a href="#ref-marsalTopologicalWeaireThorpe2020" role="doc-biblioref">3</a>]</span> and Gino did the implementation of this. The idea and implementation of the edge colouring using SAT solvers and the mapping from flux sector to bond sector using A* search were both entirely my work. Peru produced the numerical evidence for the ground state and implemented the local markers. Gino and I did much of the rest of the programming for Koala collaboratively, often pair programming, this included the phase diagram, edge mode and finite temperature analyses as well as the derivation of the projector in the amorphous case.</p>
|
||||
</section>
|
||||
<section id="ak-summary" class="level3">
|
||||
<h3>Chapter Summary</h3>
|
||||
<p>In this chapter, I will first define the amorphous Kitaev (AK) model and discuss the construction of amorphous lattices. Second, in the <a href="../4_Amorphous_Kitaev_Model/4.2_AMK_Methods.html#amk-methods">methods</a> section I will discuss the details of voronisation and graph colouring. Finally I will present and interpret the <a href="../4_Amorphous_Kitaev_Model/4.3_AMK_Results.html#amk-results">results</a> obtained.</p>
|
||||
<p>In this chapter, I will first define the amorphous Kitaev (AK) model and discuss the construction of amorphous lattices. Second, in the <a href="../4_Amorphous_Kitaev_Model/4.2_AMK_Methods.html#amk-methods">methods</a> section I will discuss the details of voronisation and graph colouring. Finally, I will present and interpret the <a href="../4_Amorphous_Kitaev_Model/4.3_AMK_Results.html#amk-results">results</a> obtained.</p>
|
||||
<p>From its introduction it was known that the Kitaev Honeycomb (KH) model is solvable on any trivalent lattice. Consequently, it has been generalised to many such lattices <span class="citation" data-cites="eschmannThermodynamicClassificationThreedimensional2020 Yao2009 eschmann2019thermodynamics Peri2020"> [<a href="#ref-eschmannThermodynamicClassificationThreedimensional2020" role="doc-biblioref">4</a>–<a href="#ref-Peri2020" role="doc-biblioref">7</a>]</span> but so far none that entirely lack translation symmetry. Here we will do just that.</p>
|
||||
<p>Amorphous lattices are characterised by local constraints but no long-range order. They arise, for instance, in amorphous semiconductors like Silicon and Germanium <span class="citation" data-cites="Yonezawa1983 zallen2008physics"> [<a href="#ref-Yonezawa1983" role="doc-biblioref">8</a>,<a href="#ref-zallen2008physics" role="doc-biblioref">9</a>]</span>. Recent work has shown that topological insulating (TI) phases, characterised by protected edge states and topological bulk invariants, can exist in amorphous systems <span class="citation" data-cites="mitchellAmorphousTopologicalInsulators2018 agarwala2019topological marsalTopologicalWeaireThorpeModels2020 costa2019toward agarwala2020higher spring2021amorphous corbae2019evidence"> [<a href="#ref-mitchellAmorphousTopologicalInsulators2018" role="doc-biblioref">10</a>–<a href="#ref-corbae2019evidence" role="doc-biblioref">16</a>]</span>. TI phases, however, arise in non-interacting systems. In this context, we might ask whether Quantum Spin Liquid (QSL) systems and the Kitaev Honeycomb (KH) model, in particular, could be realised on amorphous lattices. The phases of the KH model have many similarities with TIs but differ in that the KH model is an interacting system. In general, research on amorphous electronic systems has been focused mainly on non-interacting systems with the exception of amorphous superconductivity <span class="citation" data-cites="buckel1954einfluss mcmillan1981electron meisel1981eliashberg bergmann1976amorphous mannaNoncrystallineTopologicalSuperconductors2022"> [<a href="#ref-buckel1954einfluss" role="doc-biblioref">17</a>–<a href="#ref-mannaNoncrystallineTopologicalSuperconductors2022" role="doc-biblioref">21</a>]</span> or very recent work looking to understand the effect of strong electron repulsion in TIs <span class="citation" data-cites="kim2022fractionalization"> [<a href="#ref-kim2022fractionalization" role="doc-biblioref">22</a>]</span>.</p>
|
||||
<p>Amorphous lattices are characterised by local constraints but no long-range order. They arise, for instance, in amorphous semiconductors like silicon and germanium <span class="citation" data-cites="Yonezawa1983 zallen2008physics"> [<a href="#ref-Yonezawa1983" role="doc-biblioref">8</a>,<a href="#ref-zallen2008physics" role="doc-biblioref">9</a>]</span>. Recent work has shown that topological insulating (TI) phases, characterised by protected edge states and topological bulk invariants, can exist in amorphous systems <span class="citation" data-cites="mitchellAmorphousTopologicalInsulators2018 agarwala2019topological marsalTopologicalWeaireThorpeModels2020 costa2019toward agarwala2020higher spring2021amorphous corbae2019evidence"> [<a href="#ref-mitchellAmorphousTopologicalInsulators2018" role="doc-biblioref">10</a>–<a href="#ref-corbae2019evidence" role="doc-biblioref">16</a>]</span>. TI phases, however, arise in non-interacting systems. In this context, we might ask whether Quantum Spin Liquid (QSL) systems and the Kitaev Honeycomb (KH) model, in particular, could be realised on amorphous lattices. The phases of the KH model have many similarities with TIs but differ in that the KH model is an interacting system. In general, research on amorphous electronic systems has been focused mainly on non-interacting systems with the exception of amorphous superconductivity <span class="citation" data-cites="buckel1954einfluss mcmillan1981electron meisel1981eliashberg bergmann1976amorphous mannaNoncrystallineTopologicalSuperconductors2022"> [<a href="#ref-buckel1954einfluss" role="doc-biblioref">17</a>–<a href="#ref-mannaNoncrystallineTopologicalSuperconductors2022" role="doc-biblioref">21</a>]</span> or very recent work looking to understand the effect of strong electron repulsion in TIs <span class="citation" data-cites="kim2022fractionalization"> [<a href="#ref-kim2022fractionalization" role="doc-biblioref">22</a>]</span>.</p>
|
||||
<p>The KH model is a magnetic system. Magnetism in amorphous systems has been investigated since the 1960s, mostly through the adaptation of theoretical tools developed for disordered systems <span class="citation" data-cites="aharony1975critical Petrakovski1981 kaneyoshi1992introduction Kaneyoshi2018"> [<a href="#ref-aharony1975critical" role="doc-biblioref">23</a>–<a href="#ref-Kaneyoshi2018" role="doc-biblioref">26</a>]</span>. This is not always ideal, we have already seen that the topological disorder of amorphous lattices can be qualitatively different from standard bond or site disorder, especially in 2D <span class="citation" data-cites="barghathiPhaseTransitionsRandom2014 schrauthViolationHarrisBarghathiVojtaCriterion2018"> [<a href="#ref-barghathiPhaseTransitionsRandom2014" role="doc-biblioref">27</a>,<a href="#ref-schrauthViolationHarrisBarghathiVojtaCriterion2018" role="doc-biblioref">28</a>]</span>. Research focused on classical Heisenberg and Ising models has accounted for the observed behaviour of ferromagnetism, disordered antiferromagnetism and widely observed spin glass behaviour <span class="citation" data-cites="coey1978amorphous"> [<a href="#ref-coey1978amorphous" role="doc-biblioref">29</a>]</span>. However, the role of the spin-anisotropic interactions and quantum effects that we see in the KH model has not been addressed in amorphous magnets. It is an open question whether frustrated magnetic interactions on amorphous lattices can give rise to genuine quantum phases such as QSLs <span class="citation" data-cites="Anderson1973 Knolle2019 Savary2016 Lacroix2011"> [<a href="#ref-Anderson1973" role="doc-biblioref">30</a>–<a href="#ref-Lacroix2011" role="doc-biblioref">33</a>]</span>. This chapter will answer that question by demonstrating that the Kitaev model on amorphous lattices leads to a kind of QSL called a chiral spin liquid.</p>
|
||||
<p>In this section, I will discuss how to generalise the KH to an amorphous lattice. The <a href="../4_Amorphous_Kitaev_Model/4.2_AMK_Methods.html#amk-methods">methods section</a> discusses how to generate amorphous lattices using Voronoi partitions of the plane <span class="citation" data-cites="mitchellAmorphousTopologicalInsulators2018 marsalTopologicalWeaireThorpeModels2020"> [<a href="#ref-mitchellAmorphousTopologicalInsulators2018" role="doc-biblioref">10</a>,<a href="#ref-marsalTopologicalWeaireThorpeModels2020" role="doc-biblioref">12</a>]</span>, colour them using a SAT solver and how to map back and forth between gauge field configurations and flux configurations. In the <a href="../4_Amorphous_Kitaev_Model/4.3_AMK_Results.html#amk-results">results section</a>, I will show extensive numerical evidence that the AK model follows the simple generalisation to Lieb’s theorem <span class="citation" data-cites="lieb_flux_1994"> [<a href="#ref-lieb_flux_1994" role="doc-biblioref">34</a>]</span> found by other works <span class="citation" data-cites="eschmannThermodynamicClassificationThreedimensional2020 Yao2009 eschmann2019thermodynamics Peri2020"> [<a href="#ref-eschmannThermodynamicClassificationThreedimensional2020" role="doc-biblioref">4</a>–<a href="#ref-Peri2020" role="doc-biblioref">7</a>]</span>. I then map out the phase diagram of the AK model and show that the chiral phase around the symmetric point (<span class="math inline">\(J_x = J_y = J_z\)</span>) is gapped and non-Abelian. We use a quantised local Chern number <span class="math inline">\(\nu\)</span> <span class="citation" data-cites="peru_preprint mitchellAmorphousTopologicalInsulators2018"> [<a href="#ref-mitchellAmorphousTopologicalInsulators2018" role="doc-biblioref">10</a>,<a href="#ref-peru_preprint" role="doc-biblioref">35</a>]</span> as well as the presence of protected chiral Majorana edge modes to determine this. Finally, I look at the role of finite temperature fluctuations and show that the proliferation of flux excitations leads to an Anderson transition, similar to that of the Falicov-Kimball model, to a thermal metal phase <span class="citation" data-cites="Laumann2012 lahtinenTopologicalLiquidNucleation2012 selfThermallyInducedMetallic2019"> [<a href="#ref-Laumann2012" role="doc-biblioref">36</a>–<a href="#ref-selfThermallyInducedMetallic2019" role="doc-biblioref">38</a>]</span>. Finally, I consider possible physical realisations of the AK model and other generalisations.</p>
|
||||
</section>
|
||||
@ -101,19 +101,19 @@ image:
|
||||
<section id="amk-Model" class="level1">
|
||||
<h1>The Model</h1>
|
||||
<figure>
|
||||
<img src="/assets/thesis/amk_chapter/intro/amk_zoom/amk_zoom_by_hand.svg" id="fig-amk-zoom" data-short-caption="The Kitaev Honeycomb Model" style="width:100.0%" alt="Figure 1: (a) The standard Kitaev model is defined on a honeycomb lattice. The special feature of the honeycomb lattice that makes the model solvable is that each vertex is joined by exactly three bonds, i.e. the lattice is trivalent. One of three labels is assigned to each (b). We represent the antisymmetric gauge degree of freedom u_{jk} = \pm 1 with arrows that point in the direction u_{jk} = +1 (c). The Majorana transformation can be visualised as breaking each spin into four Majoranas which then pair along the bonds. The pairs of x,y and z Majoranas become part of the classical \mathbb{Z}_2 gauge field u_{ij}. This leaves a single Majorana c_i per site." />
|
||||
<figcaption aria-hidden="true">Figure 1: <strong>(a)</strong> The standard Kitaev model is defined on a honeycomb lattice. The special feature of the honeycomb lattice that makes the model solvable is that each vertex is joined by exactly three bonds, i.e. the lattice is trivalent. One of three labels is assigned to each <strong>(b)</strong>. We represent the antisymmetric gauge degree of freedom <span class="math inline">\(u_{jk} = \pm 1\)</span> with arrows that point in the direction <span class="math inline">\(u_{jk} = +1\)</span> <strong>(c)</strong>. The Majorana transformation can be visualised as breaking each spin into four Majoranas which then pair along the bonds. The pairs of x,y and z Majoranas become part of the classical <span class="math inline">\(\mathbb{Z}_2\)</span> gauge field <span class="math inline">\(u_{ij}\)</span>. This leaves a single Majorana <span class="math inline">\(c_i\)</span> per site.</figcaption>
|
||||
<img src="/assets/thesis/amk_chapter/intro/amk_zoom/amk_zoom_by_hand.svg" id="fig-amk-zoom" data-short-caption="The Kitaev Honeycomb Model" style="width:100.0%" alt="Figure 1: (a) The standard Kitaev model is defined on a honeycomb lattice. The special feature of the honeycomb lattice that makes the model solvable is that each vertex is joined by exactly three bonds, i.e., the lattice is trivalent. One of three labels is assigned to each (b). We represent the antisymmetric gauge degree of freedom u_{jk} = \pm 1 with arrows that point in the direction u_{jk} = +1 (c). The Majorana transformation can be visualised as breaking each spin into four Majoranas which then pair along the bonds. Pairs of b_i^x,\;b_i^y and b_i^z Majoranas become part of the classical \mathbb{Z}_2 gauge field u_{ij}. This leaves a single Majorana c_i per site." />
|
||||
<figcaption aria-hidden="true">Figure 1: <strong>(a)</strong> The standard Kitaev model is defined on a honeycomb lattice. The special feature of the honeycomb lattice that makes the model solvable is that each vertex is joined by exactly three bonds, i.e., the lattice is trivalent. One of three labels is assigned to each <strong>(b)</strong>. We represent the antisymmetric gauge degree of freedom <span class="math inline">\(u_{jk} = \pm 1\)</span> with arrows that point in the direction <span class="math inline">\(u_{jk} = +1\)</span> <strong>(c)</strong>. The Majorana transformation can be visualised as breaking each spin into four Majoranas which then pair along the bonds. Pairs of <span class="math inline">\(b_i^x,\;b_i^y\)</span> and <span class="math inline">\(b_i^z\)</span> Majoranas become part of the classical <span class="math inline">\(\mathbb{Z}_2\)</span> gauge field <span class="math inline">\(u_{ij}\)</span>. This leaves a single Majorana <span class="math inline">\(c_i\)</span> per site.</figcaption>
|
||||
</figure>
|
||||
<p>The KH model is solvable on any lattice which satisfies two properties: it must be trivalent and it must three-edge-colourable. The first property means every vertex must have three edges attached to it <span class="citation" data-cites="kitaevAnyonsExactlySolved2006 Nussinov2009"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">39</a>,<a href="#ref-Nussinov2009" role="doc-biblioref">40</a>]</span>. 2D Voronoi lattices are a well studied class of amorphous trivalent lattices <span class="citation" data-cites="mitchellAmorphousTopologicalInsulators2018 florescu_designer_2009 marsalTopologicalWeaireThorpeModels2020"> [<a href="#ref-mitchellAmorphousTopologicalInsulators2018" role="doc-biblioref">10</a>,<a href="#ref-marsalTopologicalWeaireThorpeModels2020" role="doc-biblioref">12</a>,<a href="#ref-florescu_designer_2009" role="doc-biblioref">41</a>]</span>. Given a set of seed points, the Voronoi partition divides the plane into basins, based on which seed point is closest by some metric, usually the euclidean metric. The basins of each seed point form the plaquettes of the resulting lattices, while the boundaries become the edges. The Voronoi partition exists in arbitrary dimension <span class="math inline">\(d\)</span> and produces lattices with degree <span class="math inline">\(d+1\)</span> except for degenerate cases with measure zero <span class="citation" data-cites="voronoiNouvellesApplicationsParamètres1908 watsonComputingNdimensionalDelaunay1981"> [<a href="#ref-voronoiNouvellesApplicationsParamètres1908" role="doc-biblioref">42</a>,<a href="#ref-watsonComputingNdimensionalDelaunay1981" role="doc-biblioref">43</a>]</span>. Voronoi lattices in 2D are trivalent so lend themselves naturally to the Kitaev model.</p>
|
||||
<p>Other methods of lattice generation exist. One can connect randomly placed sites based on proximity <span class="citation" data-cites="agarwala2019topological"> [<a href="#ref-agarwala2019topological" role="doc-biblioref">11</a>]</span> or create simplices from random sites <span class="citation" data-cites="christRandomLatticeField1982"> [<a href="#ref-christRandomLatticeField1982" role="doc-biblioref">44</a>]</span>. However these methods do not present a natural way to restrict the vertex degree to a constant. The most unbiased way to select trivalent graphs would be to sample uniformly from the space of possible trivalent graphs. There has been some work on how to do this using a Markov Chain Monte Carlo approach <span class="citation" data-cites="alyamiUniformSamplingDirected2016"> [<a href="#ref-alyamiUniformSamplingDirected2016" role="doc-biblioref">45</a>]</span>. However, it does not guarantee that the resulting graph is planar, which is necessary to be able to three-edge-colour the lattice, our second constraint.</p>
|
||||
<p>The KH model is solvable on any lattice which satisfies two properties: it must be trivalent and it must three-edge-colourable. The first property means every vertex must have three edges attached to it <span class="citation" data-cites="kitaevAnyonsExactlySolved2006 Nussinov2009"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">39</a>,<a href="#ref-Nussinov2009" role="doc-biblioref">40</a>]</span>. 2D Voronoi lattices are a well-studied class of amorphous trivalent lattices <span class="citation" data-cites="mitchellAmorphousTopologicalInsulators2018 florescu_designer_2009 marsalTopologicalWeaireThorpeModels2020"> [<a href="#ref-mitchellAmorphousTopologicalInsulators2018" role="doc-biblioref">10</a>,<a href="#ref-marsalTopologicalWeaireThorpeModels2020" role="doc-biblioref">12</a>,<a href="#ref-florescu_designer_2009" role="doc-biblioref">41</a>]</span>. Given a set of seed points, the Voronoi partition divides the plane into basins, based on which seed point is closest by some metric, usually the euclidean metric. The basins of each seed point form the plaquettes of the resulting lattices, while the boundaries become the edges. The Voronoi partition exists in arbitrary dimension <span class="math inline">\(d\)</span> and produces lattices with degree <span class="math inline">\(d+1\)</span> except for degenerate cases with measure zero <span class="citation" data-cites="voronoiNouvellesApplicationsParamètres1908 watsonComputingNdimensionalDelaunay1981"> [<a href="#ref-voronoiNouvellesApplicationsParamètres1908" role="doc-biblioref">42</a>,<a href="#ref-watsonComputingNdimensionalDelaunay1981" role="doc-biblioref">43</a>]</span>. Voronoi lattices in 2D are trivalent so lend themselves naturally to the Kitaev model.</p>
|
||||
<p>Other methods of lattice generation exist. One can connect randomly placed sites based on proximity <span class="citation" data-cites="agarwala2019topological"> [<a href="#ref-agarwala2019topological" role="doc-biblioref">11</a>]</span> or create simplices from random sites <span class="citation" data-cites="christRandomLatticeField1982"> [<a href="#ref-christRandomLatticeField1982" role="doc-biblioref">44</a>]</span>. However, these methods do not present a natural way to restrict the vertex degree to a constant. The most unbiased way to select trivalent graphs would be to sample uniformly from the space of possible trivalent graphs. There has been some work on how to do this using a Markov Chain Monte Carlo approach <span class="citation" data-cites="alyamiUniformSamplingDirected2016"> [<a href="#ref-alyamiUniformSamplingDirected2016" role="doc-biblioref">45</a>]</span>. However, it does not guarantee that the resulting graph is planar, which is necessary to be able to three-edge-colour the lattice, our second constraint.</p>
|
||||
<p>The second constraint, three-edge-colourability, requires that we must be able to assign labels to each bond <span class="math inline">\(\{x,y,z\}\)</span> such that no two edges with the same label meet at a vertex. Such an assignment is known as a three-edge-colouring. For translation invariant models we need only find a solution for the unit cell. This problem is usually small enough that this can be done by hand or using symmetry. For amorphous lattices, the difficulty is that, to the best of my knowledge, the problem of edge-colouring these lattices in general is in NP. To find colourings in practice, we will employ a standard method from the computer science literature for finding solutions of NP problems called a SAT solver, this is discussed in more detail in the <a href="../4_Amorphous_Kitaev_Model/4.2_AMK_Methods.html#amk-methods">methods secton</a>.</p>
|
||||
<p>We find that for large lattices there are many valid colourings. In the isotropic case <span class="math inline">\(J^\alpha = 1\)</span> the colouring has no physical significance as the definition of the four Majoranas at a site is arbitrary. In the anisotropic case this symmetry is broken at the local level but we nevertheless expect the lattices to exhibit a self averaging behaviour in larger systems such that the choice of colouring doesn’t matter.</p>
|
||||
<p>We find that for large lattices there are many valid colourings. In the isotropic case <span class="math inline">\(J^\alpha = 1\)</span> the colouring has no physical significance as the definition of the four Majoranas at a site is arbitrary. In the anisotropic case this symmetry is broken at the local level but we nevertheless expect the lattices to exhibit a self-averaging behaviour in larger systems such that the choice of colouring doesn’t matter.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/amk_chapter/intro/state_decomposition_animated/state_decomposition_animated.gif" id="fig-state_decomposition_animated" data-short-caption="State Decomposition" style="width:100.0%" alt="Figure 2: (Bond Sector) A state in the bond sector is specified by assigning \pm 1 to each edge of the lattice. However, this description has a substantial gauge degeneracy. To remove it, we decompose each state into the product of three kinds of objects: (Flux Sector) The main physically relevant quantities. Only a small number of bonds need to be flipped (compared to some arbitrary fixed reference) to reconstruct the flux sector. Here, the edges are chosen from a spanning tree of the dual lattice, so there are no loops. (Gauge Field) The ‘loopiness’ of the bond sector is in this part. This is a network of loops that can always be written as a product of the gauge operators D_j. (Topological Sector) Finally, there are two loops that have no effect on the vortex sector, nor can they be constructed from gauge symmetries D_j. These can be thought of as two fluxes \Phi_{x/y} that thread through the major and minor axes of the torus. Measuring \Phi_{x/y} amounts to constructing Wilson loops around the axes of the torus. We can flip the value of \Phi_{x} by transporting a vortex pair around the torus in the y direction, as shown here. In each of the three figures on the right, black bonds correspond to those that must be flipped, while red line are those same edges on the dual lattice. Composing the three objects together gives back the original bond sector on the left." />
|
||||
<figcaption aria-hidden="true">Figure 2: (Bond Sector) A state in the bond sector is specified by assigning <span class="math inline">\(\pm 1\)</span> to each edge of the lattice. However, this description has a substantial gauge degeneracy. To remove it, we decompose each state into the product of three kinds of objects: (Flux Sector) The main physically relevant quantities. Only a small number of bonds need to be flipped (compared to some arbitrary fixed reference) to reconstruct the flux sector. Here, the edges are chosen from a spanning tree of the dual lattice, so there are no loops. (Gauge Field) The ‘loopiness’ of the bond sector is in this part. This is a network of loops that can always be written as a product of the gauge operators <span class="math inline">\(D_j\)</span>. (Topological Sector) Finally, there are two loops that have no effect on the vortex sector, nor can they be constructed from gauge symmetries <span class="math inline">\(D_j\)</span>. These can be thought of as two fluxes <span class="math inline">\(\Phi_{x/y}\)</span> that thread through the major and minor axes of the torus. Measuring <span class="math inline">\(\Phi_{x/y}\)</span> amounts to constructing Wilson loops around the axes of the torus. We can flip the value of <span class="math inline">\(\Phi_{x}\)</span> by transporting a vortex pair around the torus in the <span class="math inline">\(y\)</span> direction, as shown here. In each of the three figures on the right, black bonds correspond to those that must be flipped, while red line are those same edges on the dual lattice. Composing the three objects together gives back the original bond sector on the left.</figcaption>
|
||||
</figure>
|
||||
<p>On a lattice with the above properties, the solution for the KH model laid out in <a href="../2_Background/2.2_HKM_Model.html#bg-hkm-model">section 2.2</a> remains applicable to our AK model. See fig. <a href="#fig:amk-zoom">1</a> for an example lattice generated by our method. The main differences are twofold. Firstly, the lattices are no longer bipartite in general and therefore contain plaquettes with an odd number of sides which enclose flux <span class="math inline">\(\pm i\)</span>. This leads the AK model to have a ground state with spontaneously broken chiral symmetry <span class="citation" data-cites="Chua2011 yaoExactChiralSpin2007 ChuaPRB2011 Fiete2012 Natori2016 Wu2009 Peri2020 WangHaoranPRB2021"> [<a href="#ref-Peri2020" role="doc-biblioref">7</a>,<a href="#ref-Chua2011" role="doc-biblioref">46</a>–<a href="#ref-WangHaoranPRB2021" role="doc-biblioref">52</a>]</span>. This is analogous to the behaviour of the original Kitaev model in response to a magnetic field. One ground state is related to the other by globally inverting the imaginary <span class="math inline">\(\phi_i\)</span> fluxes <span class="citation" data-cites="yaoExactChiralSpin2007"> [<a href="#ref-yaoExactChiralSpin2007" role="doc-biblioref">47</a>]</span>. Secondly, as the model is no longer translationally invariant, Lieb’s theorem for the ground state flux sector no longer applies. However as discussed in the background, a simple generalisation of Lieb’s theorem has been shown numerically to be applicable to many generalised Kitaev models <span class="citation" data-cites="eschmannThermodynamicClassificationThreedimensional2020 Yao2009 eschmann2019thermodynamics Peri2020"> [<a href="#ref-eschmannThermodynamicClassificationThreedimensional2020" role="doc-biblioref">4</a>–<a href="#ref-Peri2020" role="doc-biblioref">7</a>]</span>. This generalisation states that the ground state flux configuration depends on the number of sides of each plaquette as</p>
|
||||
<p><span id="eq:gs-flux-sector"><span class="math display">\[\phi = -(\pm i)^{n_{\mathrm{sides}}}\qquad{(1)}\]</span></span></p>
|
||||
<p>On a lattice with the above properties, the solution for the KH model laid out in <a href="../2_Background/2.2_HKM_Model.html#bg-hkm-model">section 2.2</a> remains applicable to our AK model. See fig. <a href="#fig:amk-zoom">1</a> for an example lattice generated by our method. The main differences are twofold. Firstly, the lattices are no longer bipartite in general and therefore contain plaquettes with an odd number of sides which enclose flux <span class="math inline">\(\pm i\)</span>. This leads the AK model to have a ground state with spontaneously broken chiral symmetry <span class="citation" data-cites="Chua2011 yaoExactChiralSpin2007 ChuaPRB2011 Fiete2012 Natori2016 Wu2009 Peri2020 WangHaoranPRB2021"> [<a href="#ref-Peri2020" role="doc-biblioref">7</a>,<a href="#ref-Chua2011" role="doc-biblioref">46</a>–<a href="#ref-WangHaoranPRB2021" role="doc-biblioref">52</a>]</span>. This is analogous to the behaviour of the original Kitaev model in response to a magnetic field. One ground state is related to the other by globally inverting the imaginary <span class="math inline">\(\phi_i\)</span> fluxes <span class="citation" data-cites="yaoExactChiralSpin2007"> [<a href="#ref-yaoExactChiralSpin2007" role="doc-biblioref">47</a>]</span>. Secondly, as the model is no longer translationally invariant, Lieb’s theorem for the ground state flux sector no longer applies. However, as discussed in the background, a simple generalisation of Lieb’s theorem has been shown numerically to be applicable to many generalised Kitaev models <span class="citation" data-cites="eschmannThermodynamicClassificationThreedimensional2020 Yao2009 eschmann2019thermodynamics Peri2020"> [<a href="#ref-eschmannThermodynamicClassificationThreedimensional2020" role="doc-biblioref">4</a>–<a href="#ref-Peri2020" role="doc-biblioref">7</a>]</span>. This generalisation states that the ground state flux configuration depends on the number of sides of each plaquette as</p>
|
||||
<p><span id="eq:gs-flux-sector"><span class="math display">\[\phi = -(\pm i)^{n_{\mathrm{sides}}},\qquad{(1)}\]</span></span></p>
|
||||
<p>with a twofold global chiral degeneracy (picking either <span class="math inline">\(+i\)</span> or <span class="math inline">\(-i\)</span> in eq. <a href="#eq:gs-flux-sector">1</a>).</p>
|
||||
<p>To verify numerically that Lieb’s theorem generalises to the AK model, the obvious approach would be via exhaustive checking of flux configurations. However, this is problematic because the number of states to check scales exponentially with system size. We side-step this by gluing together two methods, we first work with lattices small enough that we can fully enumerate their flux sectors but tile them to reduce finite size effects. We then show that the effect of tiling scales away with system size.</p>
|
||||
<figure>
|
||||
@ -136,11 +136,11 @@ The same as @fig:flood_fill but for the amorphous lattice.
|
||||
<section id="the-euler-equation" class="level2">
|
||||
<h2>The Euler Equation</h2>
|
||||
<p>Euler’s equation provides a convenient way to understand how the states of the AK model factorise into flux sectors, gauge sectors and topological sectors as in fig. <a href="#fig:state_decomposition_animated">2</a>. The Euler equation states that if we embed a lattice with <span class="math inline">\(B\)</span> bonds, <span class="math inline">\(P\)</span> plaquettes and <span class="math inline">\(V\)</span> vertices onto a closed surface of genus <span class="math inline">\(g\)</span>, (<span class="math inline">\(0\)</span> for the sphere, <span class="math inline">\(1\)</span> for the torus) then</p>
|
||||
<p><span class="math display">\[B = P + V + 2 - 2g\]</span></p>
|
||||
<p><span class="math display">\[B = P + V + 2 - 2g.\]</span></p>
|
||||
<p>For the case of the torus where <span class="math inline">\(g = 1\)</span>, we can rearrange this and exponentiate it to read:</p>
|
||||
<p><span class="math display">\[2^B = 2^{P-1}\cdot 2^{V-1} \cdot 2^2\]</span></p>
|
||||
<p><span class="math display">\[2^B = 2^{P-1}\cdot 2^{V-1} \cdot 2^2.\]</span></p>
|
||||
<p>There are <span class="math inline">\(2^B\)</span> configurations of the bond variables <span class="math inline">\(\{u_{ij}\}\)</span>. Each of these configurations can be uniquely decomposed into a flux sector, a gauge sector and a topological sector, see fig. <a href="#fig:state_decomposition_animated">2</a>. Each of the <span class="math inline">\(P\)</span> plaquette operators <span class="math inline">\(\phi_i\)</span> takes two values but vortices are created in pairs so there are <span class="math inline">\(2^{P-1}\)</span> vortex sectors in total. There are <span class="math inline">\(2^{V-1}\)</span> gauge symmetries formed from the <span class="math inline">\(V\)</span> symmetry operators <span class="math inline">\(D_i\)</span> because <span class="math inline">\(\prod_{j} D_j = \mathbb{I}\)</span> is enforced by the projector. Finally, the two topological fluxes <span class="math inline">\(\Phi_x\)</span> and <span class="math inline">\(\Phi_y\)</span> account for the last factor of <span class="math inline">\(2^2\)</span>.</p>
|
||||
<p>In a trivalent lattice, there are three bonds for every 2 vertices. Substituting <span class="math inline">\(3V = 2B\)</span> into Euler’s equation tells us that any trivalent lattice on the torus with <span class="math inline">\(N\)</span> plaquettes has <span class="math inline">\(2N\)</span> vertices and <span class="math inline">\(3N\)</span> bonds. Since each bond is part of two plaquettes this implies that the mean number of sides of a plaquette is exactly six and that odd sides plaquettes must come in pairs.</p>
|
||||
<p>In a trivalent lattice, there are three bonds for every 2 vertices. Substituting <span class="math inline">\(3V = 2B\)</span> into Euler’s equation tells us that any trivalent lattice on the torus with <span class="math inline">\(N\)</span> plaquettes has <span class="math inline">\(2N\)</span> vertices and <span class="math inline">\(3N\)</span> bonds. Since each bond is part of two plaquettes this implies that the mean number of sides of a plaquette is exactly six and that odd sided plaquettes must come in pairs.</p>
|
||||
<p>Next Section: <a href="../4_Amorphous_Kitaev_Model/4.2_AMK_Methods.html">Methods</a></p>
|
||||
</section>
|
||||
</section>
|
||||
|
@ -92,10 +92,10 @@ image:
|
||||
<figcaption aria-hidden="true">Figure 2: Different valid three-edge-colourings of an amorphous lattice. Colors that differ from the leftmost panel are highlighted in the other panels.</figcaption>
|
||||
</figure>
|
||||
<p>To be solvable, the AK model requires that each edge in the lattice be assigned a label <span class="math inline">\(x\)</span>, <span class="math inline">\(y\)</span> or <span class="math inline">\(z\)</span>, such that each vertex has exactly one edge of each type connected to it, a three-edge-colouring. This problem must be distinguished from that considered by the famous four-colour theorem <span class="citation" data-cites="appelEveryPlanarMap1989"> [<a href="#ref-appelEveryPlanarMap1989" role="doc-biblioref">9</a>]</span>. The four-colour theorem is concerned with assigning colours to the <strong>vertices</strong> of planar graphs, such that no vertices that share an edge have the same colour.</p>
|
||||
<p>For a graph of maximum degree <span class="math inline">\(\Delta\)</span>, <span class="math inline">\(\Delta + 1\)</span> colours are always enough to edge-colour it. An <span class="math inline">\(\mathcal{O}(mn)\)</span> algorithm exists to do this for a graph with <span class="math inline">\(m\)</span> edges and <span class="math inline">\(n\)</span> vertices <span class="citation" data-cites="gEstimateChromaticClass1964"> [<a href="#ref-gEstimateChromaticClass1964" role="doc-biblioref">10</a>]</span>. Graphs with <span class="math inline">\(\Delta = 3\)</span> are known as cubic graphs. Cubic graphs can be four-edge-coloured in linear time <span class="citation" data-cites="skulrattanakulchai4edgecoloringGraphsMaximum2002"> [<a href="#ref-skulrattanakulchai4edgecoloringGraphsMaximum2002" role="doc-biblioref">11</a>]</span>. However we need a three-edge-colouring of our cubic graphs, which turns out to be more difficult. Cubic, planar, <em>bridgeless</em> graphs can be three-edge-coloured if and only if they can be four-face-coloured <span class="citation" data-cites="tait1880remarks"> [<a href="#ref-tait1880remarks" role="doc-biblioref">12</a>]</span>. Bridges are edges that connect otherwise disconnected components. An <span class="math inline">\(\mathcal{O}(n^2)\)</span> algorithm exists for these <span class="citation" data-cites="robertson1996efficiently"> [<a href="#ref-robertson1996efficiently" role="doc-biblioref">13</a>]</span>. However, it is not clear whether this extends to cubic, <strong>toroidal</strong> bridgeless graphs.</p>
|
||||
<p>A four-face-colouring is equivalent to a four-vertex-colouring of the dual graph, see <a href="../6_Appendices/A.3_Lattice_Generation.html#app-lattice-generation">appendix A.4</a>. So if we could find a four-vertex-colouring of the dual graph we would be done. However vertex-colouring a toroidal graph may require up to seven colours <span class="citation" data-cites="heawoodMapColouringTheorems"> [<a href="#ref-heawoodMapColouringTheorems" role="doc-biblioref">14</a>]</span>! The complete graph of seven vertices <span class="math inline">\(K_7\)</span> is a good example of a toroidal graph that requires seven colours.</p>
|
||||
<p>For a graph of maximum degree <span class="math inline">\(\Delta\)</span>, <span class="math inline">\(\Delta + 1\)</span> colours are always enough to edge-colour it. An <span class="math inline">\(\mathcal{O}(mn)\)</span> algorithm exists to do this for a graph with <span class="math inline">\(m\)</span> edges and <span class="math inline">\(n\)</span> vertices <span class="citation" data-cites="gEstimateChromaticClass1964"> [<a href="#ref-gEstimateChromaticClass1964" role="doc-biblioref">10</a>]</span>. Graphs with <span class="math inline">\(\Delta = 3\)</span> are known as cubic graphs. Cubic graphs can be four-edge-coloured in linear time <span class="citation" data-cites="skulrattanakulchai4edgecoloringGraphsMaximum2002"> [<a href="#ref-skulrattanakulchai4edgecoloringGraphsMaximum2002" role="doc-biblioref">11</a>]</span>. However, we need a three-edge-colouring of our cubic graphs, which turns out to be more difficult. Cubic, planar, <em>bridgeless</em> graphs can be three-edge-coloured if and only if they can be four-face-coloured <span class="citation" data-cites="tait1880remarks"> [<a href="#ref-tait1880remarks" role="doc-biblioref">12</a>]</span>. Bridges are edges that connect otherwise disconnected components. An <span class="math inline">\(\mathcal{O}(n^2)\)</span> algorithm exists for these <span class="citation" data-cites="robertson1996efficiently"> [<a href="#ref-robertson1996efficiently" role="doc-biblioref">13</a>]</span>. However, it is not clear whether this extends to cubic, <strong>toroidal</strong> bridgeless graphs.</p>
|
||||
<p>A four-face-colouring is equivalent to a four-vertex-colouring of the dual graph, see <a href="../6_Appendices/A.3_Lattice_Generation.html#app-lattice-generation">appendix A.4</a>. So if we could find a four-vertex-colouring of the dual graph we would be done. However, vertex-colouring a toroidal graph may require up to seven colours <span class="citation" data-cites="heawoodMapColouringTheorems"> [<a href="#ref-heawoodMapColouringTheorems" role="doc-biblioref">14</a>]</span>! The complete graph of seven vertices <span class="math inline">\(K_7\)</span> is a good example of a toroidal graph that requires seven colours.</p>
|
||||
<p>Luckily, some problems are easier in practice. Three-edge-colouring cubic toroidal graphs is one of those things. To find colourings, we use a Boolean Satisfiability Solver or SAT solver. A SAT problem is a set of statements about a set of boolean variables <span class="math inline">\([x_1, x_2\ldots]\)</span>, such as “<span class="math inline">\(x_1\)</span> or not <span class="math inline">\(x_3\)</span> is true”. A solution to a SAT problem is an assignment <span class="math inline">\(x_i \in {0,1}\)</span> that satisfies all the statements <span class="citation" data-cites="Karp1972"> [<a href="#ref-Karp1972" role="doc-biblioref">15</a>]</span>. General purpose, high performance programs for solving SAT problems have been an area of active research for decades <span class="citation" data-cites="alounehComprehensiveStudyAnalysis2019"> [<a href="#ref-alounehComprehensiveStudyAnalysis2019" role="doc-biblioref">16</a>]</span>. Such programs are useful because, by the Cook-Levin theorem <span class="citation" data-cites="cookComplexityTheoremprovingProcedures1971 levin1973universal"> [<a href="#ref-cookComplexityTheoremprovingProcedures1971" role="doc-biblioref">17</a>,<a href="#ref-levin1973universal" role="doc-biblioref">18</a>]</span>, any NP problem can be encoded (in polynomial time) as an instance of a SAT problem. This property is what makes SAT one of the subset of NP problems called NP-Complete. It is a relatively standard technique in the computer science community to solve NP problems by first transforming them to SAT instances and then using an off-the-shelf SAT solver. The output of this can then be mapped back to the original problem domain.</p>
|
||||
<p>Whether graph colouring problems are in NP or P seems to depend delicately on the class of graphs considered, the maximum degree and the number of colours used. It is therefore possible that a polynomial time algorithm may exist for our problem. However using a SAT solver turns out to be fast enough in practice that it is by no means the rate limiting step for generating and solving instances of the AK model. In <a href="../6_Appendices/A.3_Lattice_Generation.html#app-lattice-generation">appendix A.4</a> I detail the specifics of how I mapped edge-colouring problems to SAT instances and show a breakdown of where the computational effort is spent, the majority being on matrix diagonalisation.</p>
|
||||
<p>Whether graph colouring problems are in NP or P seems to depend delicately on the class of graphs considered, the maximum degree and the number of colours used. It is therefore possible that a polynomial time algorithm may exist for our problem. However, using a SAT solver turns out to be fast enough in practice that it is by no means the rate limiting step for generating and solving instances of the AK model. In <a href="../6_Appendices/A.3_Lattice_Generation.html#app-lattice-generation">appendix A.4</a> I detail the specifics of how I mapped edge-colouring problems to SAT instances and show a breakdown of where the computational effort is spent, the majority being on matrix diagonalisation.</p>
|
||||
</section>
|
||||
<section id="mapping-between-flux-sectors-and-bond-sectors" class="level2">
|
||||
<h2>Mapping between flux sectors and bond sectors</h2>
|
||||
|
@ -94,28 +94,28 @@ image:
|
||||
</div>
|
||||
<section id="amk-results" class="level1">
|
||||
<h1>Results</h1>
|
||||
<p>This section contains our results on the AK model, we first look at how we checked numerically that Lieb’s theorem generalises to our model. Next we compute the ground state diagram and look at the two phases that arise there. We then use a local Chern marker and the presence of edge modes to characterise these phases as having Abelian or non-Abelian statistics. Finally we look at the finite temperature behaviour of the model.</p>
|
||||
<p>This section contains our results on the AK model, we first look at how we checked numerically that Lieb’s theorem generalises to our model. Next we compute the ground state diagram and look at the two phases that arise there. We then use a local Chern marker and the presence of edge modes to characterise these phases as having Abelian or non-Abelian statistics. Finally, we look at the finite temperature behaviour of the model.</p>
|
||||
<section id="the-ground-state-flux-sector" class="level2">
|
||||
<h2>The Ground State Flux Sector</h2>
|
||||
<p>We will check that Lieb’s theorem generalises to our model by enumerating all the flux sectors of many separate amorphous lattice realisations. However, we have two seemingly irreconcilable problems. Finite size effects have a large energetic contribution for small systems <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">1</a>]</span> so we would like to perform our analysis for very large lattices. For an amorphous system with <span class="math inline">\(N\)</span> plaquettes, <span class="math inline">\(2N\)</span> edges and <span class="math inline">\(3N\)</span> vertices we have <span class="math inline">\(2^{N-1}\)</span> flux sectors to check and diagonalisation scales with <span class="math inline">\(\mathcal{O}(N^3)\)</span>. That exponential scaling makes it difficult to work with lattices much larger than <span class="math inline">\(16\)</span> plaquettes with the resources.</p>
|
||||
<p>To get around this, we instead look at periodic systems with amorphous unit cells. For a similarly sized periodic system with <span class="math inline">\(A\)</span> unit cells and <span class="math inline">\(B\)</span> plaquettes in each unit cell where <span class="math inline">\(N \sim AB\)</span> things get much better. We can use Bloch’s theorem to diagonalise this system in about <span class="math inline">\(\mathcal{O}(A B^3)\)</span> operations, and more importantly there are only <span class="math inline">\(2^{B-1}\)</span> flux sectors to check. We fully enumerated the flux sectors of <span class="math inline">\(\sim\)</span> 25,000 periodic systems with disordered unit cells of up to <span class="math inline">\(B = 16\)</span> plaquettes and <span class="math inline">\(A = 100\)</span> unit cells. However, showing that our guess is correct for periodic systems with disordered unit cells is not quite convincing on its own as we have effectively removed longer-range disorder from our lattices.</p>
|
||||
<p>The second part of the argument is to show that the energetic effect of introducing periodicity scales away as we go to larger system sizes and has already diminished to a small enough value at 16 plaquettes, which is indeed what we find. From this, we argue that the results for small periodic systems generalise to large amorphous systems. In the isotropic case (<span class="math inline">\(J^\alpha = 1\)</span>), Lieb’s theorem correctly predicts the ground state flux sector for all of the lattices we tested. For the toric code phase (<span class="math inline">\(J^x = J^y = 0.25, J^z = 1\)</span>) all but around (<span class="math inline">\(\sim 0.5 \%\)</span>) lattices had ground states conforming to our conjecture. In these cases, the energy difference between the true ground state and our prediction was on the order of <span class="math inline">\(10^{-6} J\)</span>.</p>
|
||||
<p>The spin Kitaev Hamiltonian is real and therefore has time reversal symmetry. However in the ground state the flux <span class="math inline">\(\phi_p\)</span> through any plaquette with an odd number of sides has imaginary eigenvalues <span class="math inline">\(\pm i\)</span>. Thus, states with a fixed flux sector spontaneously break time reversal symmetry. Kiteav noted this in his original paper but it was first explored in a concrete model by Yao and Kivelson for a translation invariant Kitaev model with odd sided plaquettes <span class="citation" data-cites="Yao2011"> [<a href="#ref-Yao2011" role="doc-biblioref">2</a>]</span>.</p>
|
||||
<p>The second part of the argument is to show that the energetic effect of introducing periodicity scales away as we go to larger system sizes and has already diminished to a small enough value at 16 plaquettes, which is indeed what we find. From this, we argue that the results for small periodic systems generalise to large amorphous systems. In the isotropic case (<span class="math inline">\(J^\alpha = 1\)</span>), Lieb’s theorem correctly predicts the ground state flux sector for all of the lattices we tested. For the toric code phase (<span class="math inline">\(J^x = J^y = 0.25, J^z = 1\)</span>) all but around <span class="math inline">\(\sim 0.5 \%\)</span> of lattices had ground states conforming to our conjecture. In these cases, the energy difference between the true ground state and our prediction was on the order of <span class="math inline">\(10^{-6}]\;J\)</span>.</p>
|
||||
<p>The spin Kitaev Hamiltonian is real and therefore has time reversal symmetry. However, in the ground state the flux <span class="math inline">\(\phi_p\)</span> through any plaquette with an odd number of sides has imaginary eigenvalues <span class="math inline">\(\pm i\)</span>. Thus, states with a fixed flux sector spontaneously break time reversal symmetry. Kiteav noted this in his original paper but it was first explored in a concrete model by Yao and Kivelson for a translation invariant Kitaev model with odd sided plaquettes <span class="citation" data-cites="Yao2011"> [<a href="#ref-Yao2011" role="doc-biblioref">2</a>]</span>.</p>
|
||||
<p>Flux sectors come in degenerate pairs, where time reversal is equivalent to inverting the flux through every odd plaquette, a general feature for lattices with odd plaquettes <span class="citation" data-cites="yaoExactChiralSpin2007 Peri2020"> [<a href="#ref-yaoExactChiralSpin2007" role="doc-biblioref">3</a>,<a href="#ref-Peri2020" role="doc-biblioref">4</a>]</span>. This spontaneously broken symmetry serves a role analogous to the external magnetic field in the original honeycomb model, leading the AK model to have a non-Abelian anyonic phase without an external magnetic field.</p>
|
||||
</section>
|
||||
<section id="ground-state-phase-diagram" class="level2">
|
||||
<h2>Ground State Phase Diagram</h2>
|
||||
<p>The triangular <span class="math inline">\(J_x, J_y, J_z\)</span> phase diagram of this family of models arises from setting the energy scale with <span class="math inline">\(J_x + J_y + J_z = 1\)</span>. The intersection of this plane and the unit cube is what yields the equilateral triangles seen in diagrams like fig. <a href="#fig:phase_diagram">1</a>. The KH model has an Abelian gapped phase in the anisotropic region (the A phase) and is gapless in the isotropic region. The introduction of a magnetic field breaks the chiral symmetry, leading to the isotropic region becoming a gapped, non-Abelian phase, the B phase.</p>
|
||||
<p>Similar to the KH model with a magnetic field, we find that the amorphous model is only gapless along critical lines, see fig. <a href="#fig:phase_diagram">1</a> (Left). Interestingly, in finite size systems the gap closing exists in only one of the four topological sectors though the sectors become degenerate in the thermodynamic limit. Nevertheless, this could be a useful way to define the (0, 0) topological flux sector for the amorphous model which otherwise has no natural way to choose it.</p>
|
||||
<p>In the honeycomb model, the phase boundaries are located on the straight lines <span class="math inline">\(|J^x| = |J^y| \;+ \;|J^z|\)</span> and permutations of <span class="math inline">\(x,y,z\)</span>. These are shown as dotted lines in fig. <a href="#fig:phase_diagram">1</a> (Right). We find that on the amorphous lattice these boundaries exhibit an inward curvature, similar to honeycomb Kitaev models with flux or bond disorder <span class="citation" data-cites="knolle_dynamics_2016 Nasu_Thermal_2015 lahtinenPerturbedVortexLattices2014 willansDisorderQuantumSpin2010 zschockePhysicalStatesFinitesize2015 kaoDisorderDisorderLocalization2021"> [<a href="#ref-knolle_dynamics_2016" role="doc-biblioref">5</a>–<a href="#ref-kaoDisorderDisorderLocalization2021" role="doc-biblioref">10</a>]</span>.</p>
|
||||
<p>Similar to the KH model with a magnetic field, we find that the amorphous model is only gapless along critical lines, see the left panel of fig. <a href="#fig:phase_diagram">1</a> (left panel). Interestingly, in finite size systems the gap closing exists in only one of the four topological sectors though the sectors become degenerate in the thermodynamic limit. Nevertheless, this could be a useful way to define the (0, 0) topological flux sector for the amorphous model which otherwise has no natural way to choose it.</p>
|
||||
<p>In the honeycomb model, the phase boundaries are located on the straight lines <span class="math inline">\(|J^x| = |J^y| \;+ \;|J^z|\)</span> and permutations of <span class="math inline">\(x,y,z\)</span>. These are shown as dotted lines in fig. <a href="#fig:phase_diagram">1</a> (right panel). We find that on the amorphous lattice these boundaries exhibit an inward curvature, similar to honeycomb Kitaev models with flux or bond disorder <span class="citation" data-cites="knolle_dynamics_2016 Nasu_Thermal_2015 lahtinenPerturbedVortexLattices2014 willansDisorderQuantumSpin2010 zschockePhysicalStatesFinitesize2015 kaoDisorderDisorderLocalization2021"> [<a href="#ref-knolle_dynamics_2016" role="doc-biblioref">5</a>–<a href="#ref-kaoDisorderDisorderLocalization2021" role="doc-biblioref">10</a>]</span>.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/amk_chapter/results/phase_diagram/phase_diagram.svg" id="fig-phase_diagram" data-short-caption="The Ground State Phase Diagram" style="width:100.0%" alt="Figure 1: The phase diagram of the model can be characterised by an equilateral triangle whose corners indicate points where J_\alpha = 1, J_\beta = J_\gamma = 0 while the centre denotes J_x = J_y = J_z. (Center) To compute critical lines efficiently in this space, we evaluate the order parameter of interest along rays shooting from the corners of the phase diagram. The ray highlighted in red defines the values of J used for the left figure. (Left) The fermion gap as a function of J for an amorphous system with 20 plaquettes, where the x axis is the position on the red line in the central figure from 0 to 1. For finite size systems the four topological sectors are not degenerate and only one of them (in green) has a true gap closing. (Right) The Abelian A_\alpha phases of the model and the non-Abelian B phase separated by critical lines where the fermion gap closes. Later we will show that the Chern number \nu changes from 0 to \pm 1 from the A phases to the B phase. Indeed, the gap must close in order for the Chern number to change [11]." />
|
||||
<figcaption aria-hidden="true">Figure 1: The phase diagram of the model can be characterised by an equilateral triangle whose corners indicate points where <span class="math inline">\(J_\alpha = 1, J_\beta = J_\gamma = 0\)</span> while the centre denotes <span class="math inline">\(J_x = J_y = J_z\)</span>. (Center) To compute critical lines efficiently in this space, we evaluate the order parameter of interest along rays shooting from the corners of the phase diagram. The ray highlighted in red defines the values of J used for the left figure. (Left) The fermion gap as a function of J for an amorphous system with 20 plaquettes, where the x axis is the position on the red line in the central figure from 0 to 1. For finite size systems the four topological sectors are not degenerate and only one of them (in green) has a true gap closing. (Right) The Abelian <span class="math inline">\(A_\alpha\)</span> phases of the model and the non-Abelian B phase separated by critical lines where the fermion gap closes. Later we will show that the Chern number <span class="math inline">\(\nu\)</span> changes from <span class="math inline">\(0\)</span> to <span class="math inline">\(\pm 1\)</span> from the A phases to the B phase. Indeed, the gap <em>must</em> close in order for the Chern number to change <span class="citation" data-cites="ezawaTopologicalPhaseTransition2013"> [<a href="#ref-ezawaTopologicalPhaseTransition2013" role="doc-biblioref">11</a>]</span>.</figcaption>
|
||||
<figcaption aria-hidden="true">Figure 1: The phase diagram of the model can be characterised by an equilateral triangle whose corners indicate points where <span class="math inline">\(J_\alpha = 1, J_\beta = J_\gamma = 0\)</span> while the centre denotes <span class="math inline">\(J_x = J_y = J_z\)</span>. (Center) To compute critical lines efficiently in this space, we evaluate the order parameter of interest along rays shooting from the corners of the phase diagram. The ray highlighted in red defines the values of <span class="math inline">\(J\)</span> used for the left figure. (Left) The fermion gap as a function of <span class="math inline">\(J\)</span> for an amorphous system with 20 plaquettes, where the <span class="math inline">\(x\)</span> axis is the position on the red line in the central figure from 0 to 1. For finite size systems the four topological sectors are not degenerate and only one of them (in green) has a true gap closing. (Right) The Abelian <span class="math inline">\(A_\alpha\)</span> phases of the model and the non-Abelian B phase separated by critical lines where the fermion gap closes. Later we will show that the Chern number <span class="math inline">\(\nu\)</span> changes from <span class="math inline">\(0\)</span> to <span class="math inline">\(\pm 1\)</span> from the A phases to the B phase. Indeed, the gap <em>must</em> close in order for the Chern number to change <span class="citation" data-cites="ezawaTopologicalPhaseTransition2013"> [<a href="#ref-ezawaTopologicalPhaseTransition2013" role="doc-biblioref">11</a>]</span>.</figcaption>
|
||||
</figure>
|
||||
<section id="abelian-or-non-abelian-statistics-of-the-gapped-phase" class="level3">
|
||||
<h3>Abelian or non-Abelian statistics of the Gapped Phase</h3>
|
||||
<p>The two phases of the amorphous model are gapped as we can see from the finite size scaling of fig. <a href="#fig:fermion_gap_vs_L">4</a>. The next question is: do these phases support excitations with trivial, Abelian or non-Abelian statistics? To answer that we turn to Chern numbers <span class="citation" data-cites="berryQuantalPhaseFactors1984 simonHolonomyQuantumAdiabatic1983 thoulessQuantizedHallConductance1982"> [<a href="#ref-berryQuantalPhaseFactors1984" role="doc-biblioref">12</a>–<a href="#ref-thoulessQuantizedHallConductance1982" role="doc-biblioref">14</a>]</span>. As discussed earlier the Chern number is a quantity intimately linked to both the topological properties and the anyonic statistics of a model. Here we will make use of the fact that the Abelian/non-Abelian character of a model is linked to its Chern number <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">1</a>]</span>. The Chern number is only defined for the translation invariant case so we instead use a family of real space generalisations of the Chern number that work for amorphous systems called local topological markers <span class="citation" data-cites="bianco_mapping_2011 Hastings_Almost_2010 mitchellAmorphousTopologicalInsulators2018"> [<a href="#ref-bianco_mapping_2011" role="doc-biblioref">15</a>–<a href="#ref-mitchellAmorphousTopologicalInsulators2018" role="doc-biblioref">17</a>]</span>.</p>
|
||||
<p>There are many possible choices here, indeed Kitaev defines one in his original paper on the KH model <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">1</a>]</span>. Here we use the crosshair marker of <span class="citation" data-cites="peru_preprint"> [<a href="#ref-peru_preprint" role="doc-biblioref">18</a>]</span> because it works well on smaller systems. We calculate the projector <span class="math inline">\(P = \sum_i |\psi_i\rangle \langle \psi_i|\)</span> onto the occupied fermion eigenstates of the system in open boundary conditions. The projector encodes local information about the occupied eigenstates of the system and in gapped systems it is exponentially localised <span class="citation" data-cites="hastingsLiebSchultzMattisHigherDimensions2004"> [<a href="#ref-hastingsLiebSchultzMattisHigherDimensions2004" role="doc-biblioref">19</a>]</span>. The name <em>crosshair</em> comes from the fact that the marker is defined with respect to a particular point <span class="math inline">\((x_0, y_0)\)</span> by step functions in x and y</p>
|
||||
<p>There are many possible choices here, indeed Kitaev defines one in his original paper on the KH model <span class="citation" data-cites="kitaevAnyonsExactlySolved2006"> [<a href="#ref-kitaevAnyonsExactlySolved2006" role="doc-biblioref">1</a>]</span>. Here we use the crosshair marker of <span class="citation" data-cites="peru_preprint"> [<a href="#ref-peru_preprint" role="doc-biblioref">18</a>]</span> because it works well on smaller systems. We calculate the projector <span class="math inline">\(P = \sum_i |\psi_i\rangle \langle \psi_i|\)</span> onto the occupied fermion eigenstates of the system in open boundary conditions. The projector encodes local information about the occupied eigenstates of the system and in gapped systems it is exponentially localised <span class="citation" data-cites="hastingsLiebSchultzMattisHigherDimensions2004"> [<a href="#ref-hastingsLiebSchultzMattisHigherDimensions2004" role="doc-biblioref">19</a>]</span>. The name <em>crosshair</em> comes from the fact that the marker is defined with respect to a particular point <span class="math inline">\((x_0, y_0)\)</span> by step functions in <span class="math inline">\(x\)</span> and <span class="math inline">\(y\)</span></p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
\nu (x, y) = 4\pi \; \Im\; \mathrm{Tr}_{\mathrm{B}}
|
||||
\left (
|
||||
@ -161,7 +161,7 @@ image:
|
||||
</section>
|
||||
<section id="sec:AMK-Conclusion" class="level1">
|
||||
<h1>Discussion and Conclusion</h1>
|
||||
<p>In this chapter we have looked at an extension of the KH model to amorphous lattices with coordination number three. We discussed a method to construct arbitrary trivalent lattices using Voronoi partitions, how to embed them onto the torus and how to edge-colour them using a SAT solver. We showed numerically that the ground state flux sector of the model is given by a simple extension of Lieb’s theorem. The model has two gapped QSL phases. The two phases support excitations with different anyonic statistics, Abelian and non-Abelian, distinguished using a real-space generalisation of the Chern number <span class="citation" data-cites="peru_preprint"> [<a href="#ref-peru_preprint" role="doc-biblioref">18</a>]</span>. The presence of odd-sided plaquettes in the model resulted in spontaneous breaking of time reversal symmetry, leading to the emergence of a chiral spin liquid phase. Finally we showed evidence that the amorphous system undergoes an Anderson transition to a thermal metal phase, driven by the proliferation of vortices with increasing temperature. The AK model is an exactly solvable model of the chiral QSL state, one of the first models to exhibit a topologically non-trivial quantum many-body phase on an amorphous lattice. As such this study provides a number of future lines of research.</p>
|
||||
<p>In this chapter we have looked at an extension of the KH model to amorphous lattices with coordination number three. We discussed a method to construct arbitrary trivalent lattices using Voronoi partitions, how to embed them onto the torus and how to edge-colour them using a SAT solver. We showed numerically that the ground state flux sector of the model is given by a simple extension of Lieb’s theorem. The model has two gapped QSL phases. The two phases support excitations with different anyonic statistics, Abelian and non-Abelian, distinguished using a real-space generalisation of the Chern number <span class="citation" data-cites="peru_preprint"> [<a href="#ref-peru_preprint" role="doc-biblioref">18</a>]</span>. The presence of odd-sided plaquettes in the model resulted in spontaneous breaking of time reversal symmetry, leading to the emergence of a chiral spin liquid phase. Finally, we showed evidence that the amorphous system undergoes an Anderson transition to a thermal metal phase, driven by the proliferation of vortices with increasing temperature. The AK model is an exactly solvable model of the chiral QSL state, one of the first models to exhibit a topologically non-trivial quantum many-body phase on an amorphous lattice. As such this study provides a number of future lines of research.</p>
|
||||
<section id="experimental-realisations-and-signatures" class="level3">
|
||||
<h3>Experimental Realisations and Signatures</h3>
|
||||
<p>We should consider whether a physical amorphous system that supports a QSL ground state could exist. The search for translation invariant Kitaev systems is already motivated by the prospect of a physically realised QSL state, Majorana fermions and direct access to a system with emergent <span class="math inline">\(\mathbb{Z}_2\)</span> gauge physics <span class="citation" data-cites="TrebstPhysRep2022"> [<a href="#ref-TrebstPhysRep2022" role="doc-biblioref">29</a>]</span>. In addition to all this, an amorphous Kitaev model would provide the possibility of exploring the CSL state and potentially very different routes to a physical realisation. One route would be to ask if any crystalline Kitaev material candidates can be heated and rapidly quenched <span class="citation" data-cites="Weaire1976 Petrakovski1981 Kaneyoshi2018"> [<a href="#ref-Weaire1976" role="doc-biblioref">30</a>–<a href="#ref-Kaneyoshi2018" role="doc-biblioref">32</a>]</span> to produce amorphous analogues that might preserve enough of their local structure to support a QSL state.</p>
|
||||
@ -175,7 +175,7 @@ image:
|
||||
<p>The KH model can be extended to 3D either on trivalent lattices <span class="citation" data-cites="eschmannThermodynamicClassificationThreedimensional2020 OBrienPRB2016"> [<a href="#ref-eschmannThermodynamicClassificationThreedimensional2020" role="doc-biblioref">54</a>,<a href="#ref-OBrienPRB2016" role="doc-biblioref">55</a>]</span> or it can be generalised to an exactly solvable spin-<span class="math inline">\(\tfrac{3}{2}\)</span> model on 3D four-coordinate lattices <span class="citation" data-cites="yaoAlgebraicSpinLiquid2009 wenQuantumOrderStringnet2003 ryuThreedimensionalTopologicalPhase2009 Baskaran2008 Nussinov2009 Yao2011 Chua2011 Natori2020 Chulliparambil2020 Chulliparambil2021 Seifert2020 WangHaoranPRB2021 Wu2009"> [<a href="#ref-Yao2011" role="doc-biblioref">2</a>,<a href="#ref-yaoAlgebraicSpinLiquid2009" role="doc-biblioref">57</a>–<a href="#ref-Wu2009" role="doc-biblioref">68</a>]</span>. In <span class="citation" data-cites="yaoAlgebraicSpinLiquid2009"> [<a href="#ref-yaoAlgebraicSpinLiquid2009" role="doc-biblioref">57</a>]</span>, the 2D square lattice with 4 bond types (<span class="math inline">\(J_w, J_x, J_y, J_z\)</span>) is considered. Since Voronoi partitions in 3D produce lattices of degree four, one interesting generalisation of this work would be to look at the spin-<span class="math inline">\(\tfrac{3}{2}\)</span> Kitaev model on amorphous lattices.</p>
|
||||
<p>We did not perform a full Markov Chain Monte Carlo (MCMC) simulation of the AK model at finite temperature but the possible extension to a 3D model with an FTPT would motivate this full analysis. This MCMC simulation would be a numerically challenging task but poses no conceptual barriers <span class="citation" data-cites="Laumann2012 lahtinenTopologicalLiquidNucleation2012 selfThermallyInducedMetallic2019"> [<a href="#ref-selfThermallyInducedMetallic2019" role="doc-biblioref">23</a>,<a href="#ref-Laumann2012" role="doc-biblioref">25</a>,<a href="#ref-lahtinenTopologicalLiquidNucleation2012" role="doc-biblioref">27</a>]</span>. Doing this would, first, allow one to look for possible violations of the Harris criterion <span class="citation" data-cites="harrisEffectRandomDefects1974"> [<a href="#ref-harrisEffectRandomDefects1974" role="doc-biblioref">69</a>]</span> for the Ising transition of the flux sector. Recall that topological disorder in 2D has radically different properties to that of other kinds of disorder due to the constraints imposed by the Euler equation and maintaining coordination number which allows it to violate otherwise quite general rules like the Harris criterion <span class="citation" data-cites="barghathiPhaseTransitionsRandom2014 schrauthViolationHarrisBarghathiVojtaCriterion2018"> [<a href="#ref-barghathiPhaseTransitionsRandom2014" role="doc-biblioref">70</a>,<a href="#ref-schrauthViolationHarrisBarghathiVojtaCriterion2018" role="doc-biblioref">71</a>]</span>. Second, incorporating the projector in addition to MCMC would allow for a full investigation of whether the effect of topological degeneracy is apparent at finite temperatures, this is done for the KH model in <span class="citation" data-cites="selfThermallyInducedMetallic2019"> [<a href="#ref-selfThermallyInducedMetallic2019" role="doc-biblioref">23</a>]</span>.</p>
|
||||
<p>Next, one could investigate whether a QSL phase may exist for other models defined on amorphous lattices with a view to more realistic prospects of observation. Do the properties of the Kitaev-Heisenberg model generalise from the honeycomb to the amorphous case? <span class="citation" data-cites="Chaloupka2010 Chaloupka2015 Jackeli2009 Kalmeyer1989 manousakis1991"> [<a href="#ref-Chaloupka2010" role="doc-biblioref">41</a>,<a href="#ref-Chaloupka2015" role="doc-biblioref">43</a>,<a href="#ref-Jackeli2009" role="doc-biblioref">72</a>–<a href="#ref-manousakis1991" role="doc-biblioref">74</a>]</span> Alternatively we might look at other lattice construction techniques. For instance we could construct lattices by linking close points <span class="citation" data-cites="agarwala2019topological"> [<a href="#ref-agarwala2019topological" role="doc-biblioref">75</a>]</span> or create simplices from random sites <span class="citation" data-cites="christRandomLatticeField1982"> [<a href="#ref-christRandomLatticeField1982" role="doc-biblioref">76</a>]</span>. Lattices constructed using these methods would likely have a large number of lattice defects where <span class="math inline">\(z \neq 3\)</span> in the bulk, leading to many localised Majorana zero modes.</p>
|
||||
<p>We found a small number of lattices for which Lieb’s theorem did not correctly predict the true ground state flux sector. I see two possibilities for what could cause this. Firstly it could be a finite size effect that is amplified by certain rare lattice configurations. It would be interesting to try to elucidate what lattice features are present when Lieb’s theorem fails. Alternatively, it might be telling that the ground state conjecture failed in the toric code A phase where the couplings are anisotropic. We showed that the colouring does not matter in the B phase. However an avenue that I did not explore was whether the particular choice of colouring for a lattice affects the physical properties in the toric code A phase. It is possible that some property of the particular colouring chosen is what leads to these rare failures of Lieb’s theorem.</p>
|
||||
<p>We found a small number of lattices for which Lieb’s theorem did not correctly predict the true ground state flux sector. I see two possibilities for what could cause this. Firstly, it could be a finite size effect that is amplified by certain rare lattice configurations. It would be interesting to try to elucidate what lattice features are present when Lieb’s theorem fails. Alternatively, it might be telling that the ground state conjecture failed in the toric code A phase where the couplings are anisotropic. We showed that the colouring does not matter in the B phase. However, an avenue that I did not explore was whether the particular choice of colouring for a lattice affects the physical properties in the toric code A phase. It is possible that some property of the particular colouring chosen is what leads to these rare failures of Lieb’s theorem.</p>
|
||||
<p>Overall, there has been surprisingly little research on amorphous quantum many-body phases despite there being plenty of material candidates. I expect the exact chiral amorphous spin liquid to find many generalisations to realistic amorphous quantum magnets.</p>
|
||||
<p>Next Chapter: <a href="../5_Conclusion/5_Conclusion.html">5 Conclusion</a></p>
|
||||
</section>
|
||||
|
@ -70,7 +70,7 @@ image:
|
||||
<img src="/assets/thesis/conclusion/mof.svg" id="fig-mof" data-short-caption="Example of a Metal Organic Framework" style="width:100.0%" alt="Figure 1: An example of a Metal Organic Framework (MOF). MOFs are large synthetic organic ligands coordinated with metal ions that can cross link to form crystals. It has been demonstrated that these materials can sustain Kitaev-like interactions [5]. They can also be made into amorphous materials [6]. This image from [7] is in the public domain." />
|
||||
<figcaption aria-hidden="true">Figure 1: An example of a Metal Organic Framework (MOF). MOFs are large synthetic organic ligands coordinated with metal ions that can cross link to form crystals. It has been demonstrated that these materials can sustain Kitaev-like interactions <span class="citation" data-cites="yamadaDesigningKitaevSpin2017"> [<a href="#ref-yamadaDesigningKitaevSpin2017" role="doc-biblioref">5</a>]</span>. They can also be made into amorphous materials <span class="citation" data-cites="bennett2014amorphous"> [<a href="#ref-bennett2014amorphous" role="doc-biblioref">6</a>]</span>. This image from <span class="citation" data-cites="rosiHydrogenStorageMicroporous2003"> [<a href="#ref-rosiHydrogenStorageMicroporous2003" role="doc-biblioref">7</a>]</span> is in the public domain.</figcaption>
|
||||
</figure>
|
||||
<p>Unlike the 2D FK model and 1D LRFK models, the KH and AK models don’t have a Finite-Temperature Phase Transition (FTPT). They immediately disorder at any finite temperature <span class="citation" data-cites="eschmannThermodynamicClassificationThreedimensional2020"> [<a href="#ref-eschmannThermodynamicClassificationThreedimensional2020" role="doc-biblioref">3</a>]</span>. However, generalisations of the KH model to 3D do in general have an FTPT. Indeed, the role of dimensionality has been a key theme in this work. Both localisation and thermodynamic phenomena depend crucially on dimensionality, with thermodynamic order generally suppressed and localisation effects strengthened in low dimensions. The graph theory that underpins the KH and AK models itself also changes strongly with dimension. Voronisation in 2D produces trivalent lattices, on which the spin-<span class="math inline">\(1/2\)</span> AK model is exactly solvable. Meanwhile in 3D, voronisation gives us <span class="math inline">\(z=4\)</span> lattices upon which a spin-<span class="math inline">\(3/2\)</span> generalisation to the KH model is exactly solvable <span class="citation" data-cites="yaoAlgebraicSpinLiquid2009 wenQuantumOrderStringnet2003 ryuThreedimensionalTopologicalPhase2009"> [<a href="#ref-yaoAlgebraicSpinLiquid2009" role="doc-biblioref">8</a>–<a href="#ref-ryuThreedimensionalTopologicalPhase2009" role="doc-biblioref">10</a>]</span>. Similarly, planar and toroidal graphs are a uniquely 2D construct. Satisfying planarity imposes constraints on the connectivity of planar graphs. This leads amorphous planar graphs to have strong anti-correlations which can violate otherwise robust bounds like the Harris criterion <span class="citation" data-cites="harrisEffectRandomDefects1974"> [<a href="#ref-harrisEffectRandomDefects1974" role="doc-biblioref">11</a>]</span>. Contrast this with Anderson localisation in 1D where only longer range correlations in the disorder can produce surprising effects <span class="citation" data-cites="aubryAnalyticityBreakingAnderson1980 dassarmaLocalizationMobilityEdges1990 dunlapAbsenceLocalizationRandomdimer1990 izrailevLocalizationMobilityEdge1999 croyAndersonLocalization1D2011 izrailevAnomalousLocalizationLowDimensional2012"> [<a href="#ref-aubryAnalyticityBreakingAnderson1980" role="doc-biblioref">12</a>–<a href="#ref-izrailevAnomalousLocalizationLowDimensional2012" role="doc-biblioref">17</a>]</span>.</p>
|
||||
<p>Unlike the 2D FK model and 1D LRFK models, the KH and AK models don’t have a finite-temperature phase transition (FTPT). They immediately disorder at any finite temperature <span class="citation" data-cites="eschmannThermodynamicClassificationThreedimensional2020"> [<a href="#ref-eschmannThermodynamicClassificationThreedimensional2020" role="doc-biblioref">3</a>]</span>. However, generalisations of the KH model to 3D do in general have an FTPT. Indeed, the role of dimensionality has been a key theme in this work. Both localisation and thermodynamic phenomena depend crucially on dimensionality, with thermodynamic order generally suppressed and localisation effects strengthened in low dimensions. The graph theory that underpins the KH and AK models itself also changes strongly with dimension. Voronisation in 2D produces trivalent lattices, on which the spin-<span class="math inline">\(1/2\)</span> AK model is exactly solvable. Meanwhile in 3D, voronisation gives us <span class="math inline">\(z=4\)</span> lattices upon which a spin-<span class="math inline">\(3/2\)</span> generalisation to the KH model is exactly solvable <span class="citation" data-cites="yaoAlgebraicSpinLiquid2009 wenQuantumOrderStringnet2003 ryuThreedimensionalTopologicalPhase2009"> [<a href="#ref-yaoAlgebraicSpinLiquid2009" role="doc-biblioref">8</a>–<a href="#ref-ryuThreedimensionalTopologicalPhase2009" role="doc-biblioref">10</a>]</span>. Similarly, planar and toroidal graphs are a uniquely 2D construct. Satisfying planarity imposes constraints on the connectivity of planar graphs. This leads amorphous planar graphs to have strong anti-correlations which can violate otherwise robust bounds like the Harris criterion <span class="citation" data-cites="harrisEffectRandomDefects1974"> [<a href="#ref-harrisEffectRandomDefects1974" role="doc-biblioref">11</a>]</span>. Contrast this with Anderson localisation in 1D where only longer range correlations in the disorder can produce surprising effects <span class="citation" data-cites="aubryAnalyticityBreakingAnderson1980 dassarmaLocalizationMobilityEdges1990 dunlapAbsenceLocalizationRandomdimer1990 izrailevLocalizationMobilityEdge1999 croyAndersonLocalization1D2011 izrailevAnomalousLocalizationLowDimensional2012"> [<a href="#ref-aubryAnalyticityBreakingAnderson1980" role="doc-biblioref">12</a>–<a href="#ref-izrailevAnomalousLocalizationLowDimensional2012" role="doc-biblioref">17</a>]</span>.</p>
|
||||
<p>Looking towards future work, the LRFK model provides multiple possible routes. One interesting idea is to park the model at a thermal critical point. This would generate a scale-free disorder potential, which could potentially lead to complex localisation physics not often seen in 1D <span class="citation" data-cites="aubryAnalyticityBreakingAnderson1980 shimasakiAnomalousLocalizationMultifractality2022"> [<a href="#ref-aubryAnalyticityBreakingAnderson1980" role="doc-biblioref">12</a>,<a href="#ref-shimasakiAnomalousLocalizationMultifractality2022" role="doc-biblioref">18</a>]</span>. Like other solvable models of disorder-free localisation, the LRFK model should also exhibit intriguing out-of-equilibrium physics, such as slow entanglement dynamics. This could be used to help understand these phenomena in more generic interacting systems <span class="citation" data-cites="hartLogarithmicEntanglementGrowth2021"> [<a href="#ref-hartLogarithmicEntanglementGrowth2021" role="doc-biblioref">19</a>]</span>. There is also the rich ground state phenomenology of the FK model as a function of filling <span class="citation" data-cites="gruberGroundStatesSpinless1990"> [<a href="#ref-gruberGroundStatesSpinless1990" role="doc-biblioref">20</a>]</span>, such as the devil’s staircase <span class="citation" data-cites="michelettiCompleteDevilStaircase1997"> [<a href="#ref-michelettiCompleteDevilStaircase1997" role="doc-biblioref">21</a>]</span> as well as superconductor like states <span class="citation" data-cites="caiVisualizingEvolutionMott2016"> [<a href="#ref-caiVisualizingEvolutionMott2016" role="doc-biblioref">22</a>]</span>. Could the LRFK model stabilise these at finite temperature? Finally, a topological variant of the LRFK model akin to the Su-Schrieffer-Heeger (SSH) model could be an interesting way to probe the interplay of topological bound states and thermal domain wall defects.</p>
|
||||
<p>Looking at the AK model, we discussed whether its physics might be realisable in amorphous versions of known KH candidate materials <span class="citation" data-cites="TrebstPhysRep2022"> [<a href="#ref-TrebstPhysRep2022" role="doc-biblioref">23</a>]</span>. Alternatively, we might be able to engineer them in synthetic materials, such as Metal Organic Frameworks (MOFs). Work on MOFs has already explored the possibility of both Kitaev-like interactions and amorphous lattices <span class="citation" data-cites="yamadaDesigningKitaevSpin2017 bennett2014amorphous"> [<a href="#ref-yamadaDesigningKitaevSpin2017" role="doc-biblioref">5</a>,<a href="#ref-bennett2014amorphous" role="doc-biblioref">6</a>]</span>. It is an open question whether the superexchange couplings that generate Kitaev interactions could survive the transition to an amorphous lattice. If the interactions do survive, there will likely be many defects of different kinds present in the resulting material. These might take the form of dangling bonds, vertex degree <span class="math inline">\(> 3\)</span> or violations of the colouring conditions. I therefore hope that future work examines how robust the QSL and CSL states of the AK model are to these kinds of disorder. This is a difficult task, as many of these classes of defects would break the integrability of the AK model that we relied on to make the work computationally feasible <span class="citation" data-cites="Rau2014 Chaloupka2010 Chaloupka2013 Chaloupka2015 Winter2016"> [<a href="#ref-Rau2014" role="doc-biblioref">24</a>–<a href="#ref-Winter2016" role="doc-biblioref">28</a>]</span>. While we are considering models with defects, we might consider alternate lattice construction techniques <span class="citation" data-cites="agarwala2019topological christRandomLatticeField1982"> [<a href="#ref-agarwala2019topological" role="doc-biblioref">29</a>,<a href="#ref-christRandomLatticeField1982" role="doc-biblioref">30</a>]</span>. Lattices constructed using these methods would have defects but may have other desirable properties when compared to Voronoi lattices.</p>
|
||||
<p>In terms of experimental signatures, we discussed the quantised thermal Hall effect <span class="citation" data-cites="Kasahara2018 Yokoi2021 Yamashita2020 Bruin2022"> [<a href="#ref-Kasahara2018" role="doc-biblioref">31</a>–<a href="#ref-Bruin2022" role="doc-biblioref">34</a>]</span>, local probes such as spin-polarised scanning tunnelling microscopy <span class="citation" data-cites="Feldmeier2020 Konig2020 Udagawa2021"> [<a href="#ref-Feldmeier2020" role="doc-biblioref">35</a>–<a href="#ref-Udagawa2021" role="doc-biblioref">37</a>]</span>, and longitudinal heat transport signatures <span class="citation" data-cites="Beenakker2013"> [<a href="#ref-Beenakker2013" role="doc-biblioref">38</a>]</span>. One possible difficulty is that the introduction of topological disorder may dilute some of these signatures. On the brighter side, topological disorder may also suppress competing interactions that would otherwise induce magnetic ordering. This could potentially widen the class of materials that could host a QSL or CSL ground state.</p>
|
||||
|
@ -70,30 +70,30 @@ image:
|
||||
<h1>Evaluation of the Fermion Free Energy</h1>
|
||||
<p>There are <span class="math inline">\(2^N\)</span> possible configurations of the spins in the LRFK model. In the language of ions and electrons (immobile and mobile species), we define <span class="math inline">\(n^k_i\)</span> to be the occupation of the <span class="math inline">\(i\)</span>th site of the <span class="math inline">\(k\)</span>th configuration. The quantum part of the free energy can then be defined through the quantum partition function <span class="math inline">\(\mathcal{Z}^k\)</span> associated with each state <span class="math inline">\(n^k_i\)</span>:</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
F^k &= -1/\beta \ln{\mathcal{Z}^k} \\
|
||||
F^k &= -1/\beta \ln{\mathcal{Z}^k}, \\
|
||||
\end{aligned}\]</span></p>
|
||||
<p>Such that the overall partition function is:</p>
|
||||
<p>such that the overall partition function is:</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
\mathcal{Z} &= \sum_k e^{- \beta H^k} Z^k \\
|
||||
&= \sum_k e^{-\beta (H^k + F^k)} \\
|
||||
&= \sum_k e^{-\beta (H^k + F^k)}. \\
|
||||
\end{aligned}\]</span></p>
|
||||
<p>Because fermions are limited to occupation numbers of 0 or 1, <span class="math inline">\(Z^k\)</span> simplifies nicely. If <span class="math inline">\(m^j_i = \{0,1\}\)</span> is defined as the occupation of the level with energy <span class="math inline">\(\epsilon^k_i\)</span> then the partition function is a sum over all the occupation states labelled by <span class="math inline">\(j\)</span>:</p>
|
||||
<p>Fermions are limited to occupation numbers of 0 or 1, so <span class="math inline">\(Z^k\)</span> simplifies nicely. If <span class="math inline">\(m^j_i = \{0,1\}\)</span> is defined as the occupation of the level with energy <span class="math inline">\(\epsilon^k_i\)</span> then the partition function is a sum over all the occupation states labelled by <span class="math inline">\(j\)</span>:</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
Z^k &= \mathrm{Tr} e^{-\beta F^k} = \sum_j e^{-\beta \sum_i m^j_i \epsilon^k_i}\\
|
||||
&= \sum_j \prod_i e^{- \beta m^j_i \epsilon^k_i}= \prod_i \sum_j e^{- \beta m^j_i \epsilon^k_i}\\
|
||||
&= \prod_i (1 + e^{- \beta \epsilon^k_i})\\
|
||||
F^k &= -1/\beta \sum_k \ln{(1 + e^{- \beta \epsilon^k_i})}
|
||||
F^k &= -1/\beta \sum_k \ln{(1 + e^{- \beta \epsilon^k_i})}.
|
||||
\end{aligned}\]</span></p>
|
||||
<p>Observables can then be calculated from the partition function, for examples the occupation numbers:</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
\langle N \rangle &= \frac{1}{\beta} \frac{1}{Z} \frac{\partial Z}{\partial \mu} = - \frac{\partial F}{\partial \mu}\\
|
||||
&= \frac{1}{\beta} \frac{1}{Z} \frac{\partial}{\partial \mu} \sum_k e^{-\beta (H^k + F^k)}\\
|
||||
&= 1/Z \sum_k (N^k_{\mathrm{ion}} + N^k_{\mathrm{electron}}) e^{-\beta (H^k + F^k)}\\
|
||||
&= 1/Z \sum_k (N^k_{\mathrm{ion}} + N^k_{\mathrm{electron}}) e^{-\beta (H^k + F^k)},\\
|
||||
\end{aligned}\]</span></p>
|
||||
<p>with the definitions:</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
N^k_{\mathrm{ion}} &= - \frac{\partial H^k}{\partial \mu} = \sum_i n^k_i\\
|
||||
N^k_{\mathrm{electron}} &= - \frac{\partial F^k}{\partial \mu} = \sum_i \left(1 + e^{\beta \epsilon^k_i}\right)^{-1}\\
|
||||
N^k_{\mathrm{electron}} &= - \frac{\partial F^k}{\partial \mu} = \sum_i \left(1 + e^{\beta \epsilon^k_i}\right)^{-1}.\\
|
||||
\end{aligned}\]</span></p>
|
||||
<p>Next Section: <a href="../6_Appendices/A.1_Particle_Hole_Symmetry.html">Particle-Hole Symmetry</a></p>
|
||||
</section>
|
||||
|
@ -66,16 +66,16 @@ image:
|
||||
<section id="particle-hole-symmetry" class="level1">
|
||||
<h1>Particle-Hole Symmetry</h1>
|
||||
<p>The Hubbard and FK models on a bipartite lattice have particle-hole (PH) symmetry <span class="math inline">\(\mathcal{P}^\dagger H \mathcal{P} = - H\)</span>, accordingly they have symmetric energy spectra. The associated symmetry operator <span class="math inline">\(\mathcal{P}\)</span> exchanges creation and annihilation operators along with a sign change between the two sublattices. In the language of the Hubbard model of electrons <span class="math inline">\(c_{\alpha,i}\)</span> with spin <span class="math inline">\(\alpha\)</span> at site <span class="math inline">\(i\)</span> the particle hole operator corresponds to the substitution of new fermion operators <span class="math inline">\(d^\dagger_{\alpha,i}\)</span> and number operators <span class="math inline">\(m_{\alpha,i}\)</span> where</p>
|
||||
<p><span class="math display">\[d^\dagger_{\alpha,i} = \epsilon_i c_{\alpha,i}\]</span> <span class="math display">\[m_{\alpha,i} = d^\dagger_{\alpha,i}d_{\alpha,i}\]</span></p>
|
||||
<p><span class="math display">\[d^\dagger_{\alpha,i} = \epsilon_i c_{\alpha,i}\]</span> <span class="math display">\[m_{\alpha,i} = d^\dagger_{\alpha,i}d_{\alpha,i},\]</span></p>
|
||||
<p>the lattices must be bipartite because to make this work we set <span class="math inline">\(\epsilon_i = +1\)</span> for the A sublattice and <span class="math inline">\(-1\)</span> for the even sublattice <span class="citation" data-cites="gruberFalicovKimballModel2006"> [<a href="#ref-gruberFalicovKimballModel2006" role="doc-biblioref">1</a>]</span>.</p>
|
||||
<p>The entirely filled state <span class="math inline">\(\ket{\Omega} = \sum_{\alpha,i} c^\dagger_{\alpha,i} \ket{0}\)</span> becomes the new vacuum state <span class="math display">\[d_{i\sigma} \ket{\Omega} = (-1)^i c^\dagger_{i\sigma} \sum_{j\rho} c^\dagger_{j\rho} \ket{0} = 0.\]</span></p>
|
||||
<p>The number operator <span class="math inline">\(m_{\alpha,i} = 0,1\)</span> now counts holes rather than electrons <span class="math display">\[ m_{\alpha,i} = c_{\alpha,i} c^\dagger_{\alpha,i} = 1 - c^\dagger_{\alpha,i} c_{\alpha,i}.\]</span></p>
|
||||
<p>In the case of nearest neighbour hopping on a bipartite lattice this transformation also leaves the hopping term unchanged because <span class="math inline">\(\epsilon_i \epsilon_j = -1\)</span> when <span class="math inline">\(i\)</span> and <span class="math inline">\(j\)</span> are on different sublattices: <span class="math display">\[ d^\dagger_{\alpha,i} d_{\alpha,j} = \epsilon_i \epsilon_j c_{\alpha,i} c^\dagger_{\alpha,j} = c^\dagger_{\alpha,i} c_{\alpha,j} \]</span></p>
|
||||
<p>The number operator <span class="math inline">\(m_{\alpha,i} = 0,1\)</span> now counts holes rather than electrons <span class="math display">\[ m_{\alpha,i} = c^{\phantom{\dagger}}_{\alpha,i} c^\dagger_{\alpha,i} = 1 - c^\dagger_{\alpha,i} c^{\phantom{\dagger}}_{\alpha,i}.\]</span></p>
|
||||
<p>In the case of nearest neighbour hopping on a bipartite lattice this transformation also leaves the hopping term unchanged because <span class="math inline">\(\epsilon_i \epsilon_j = -1\)</span> when <span class="math inline">\(i\)</span> and <span class="math inline">\(j\)</span> are on different sublattices: <span class="math display">\[ d^\dagger_{\alpha,i} d_{\alpha,j} = \epsilon_i \epsilon_j c^{\phantom{\dagger}}_{\alpha,i} c^\dagger_{\alpha,j} = c^\dagger_{\alpha,i} c^{\phantom{\dagger}}_{\alpha,j}. \]</span></p>
|
||||
<p>Defining the particle density <span class="math inline">\(\rho\)</span> as the number of fermions per site: <span class="math display">\[
|
||||
\rho = \frac{1}{N} \sum_i \left( n_{i \uparrow} + n_{i \downarrow} \right)
|
||||
\rho = \frac{1}{N} \sum_i \left( n_{i \uparrow} + n_{i \downarrow} \right).
|
||||
\]</span></p>
|
||||
<p>The PH symmetry maps the Hamiltonian to itself with the sign of the chemical potential reversed and the density inverted about half filling: <span class="math display">\[ \text{PH} : H(t, U, \mu) \rightarrow H(t, U, -\mu) \]</span> <span class="math display">\[ \rho \rightarrow 2 - \rho \]</span></p>
|
||||
<p>The Hamiltonian is symmetric under PH at <span class="math inline">\(\mu = 0\)</span> and so must all the observables, hence half filling <span class="math inline">\(\rho = 1\)</span> occurs here. This symmetry and known observable acts as a useful test for the numerical calculations.</p>
|
||||
<p>The PH symmetry maps the Hamiltonian to itself with the sign of the chemical potential reversed and the density inverted about half-filling: <span class="math display">\[ \text{PH} : H(t, U, \mu) \rightarrow H(t, U, -\mu) \]</span> <span class="math display">\[ \rho \rightarrow 2 - \rho. \]</span></p>
|
||||
<p>The Hamiltonian is symmetric under PH at <span class="math inline">\(\mu = 0\)</span> and so must all the observables, hence half-filling <span class="math inline">\(\rho = 1\)</span> occurs here. This symmetry and known observable acts as a useful test for the numerical calculations.</p>
|
||||
<p>Next Section: <a href="../6_Appendices/A.2_Markov_Chain_Monte_Carlo.html">Markov Chain Monte Carlo</a></p>
|
||||
</section>
|
||||
<section id="bibliography" class="level1 unnumbered">
|
||||
|
@ -92,21 +92,21 @@ image:
|
||||
<p>Markov Chain Monte Carlo (MCMC) is a useful method whenever we have a probability distribution that we want to sample from but there is not direct sampling way to do so.</p>
|
||||
<section id="direct-random-sampling" class="level2">
|
||||
<h2>Direct Random Sampling</h2>
|
||||
<p>In almost any computer simulation the ultimate source of randomness is a stream of (close to) uniform, uncorrelated bits generated from a pseudo random number generator. A direct sampling method takes such a source and outputs uncorrelated samples from the target distribution. The fact they’re uncorrelated is key as we’ll see later. Examples of direct sampling methods range from the trivial: take n random bits to generate integers uniformly between 0 and <span class="math inline">\(2^n\)</span> to more complex methods such as inverse transform sampling and rejection sampling <span class="citation" data-cites="devroyeRandomSampling1986"> [<a href="#ref-devroyeRandomSampling1986" role="doc-biblioref">1</a>]</span>.</p>
|
||||
<p>In almost any computer simulation the ultimate source of randomness is a stream of (close to) uniform, uncorrelated bits generated from a pseudo random number generator. A direct sampling method takes such a source and outputs uncorrelated samples from the target distribution. The fact they are uncorrelated is key as we’ll see later. Examples of direct sampling methods range from the trivial: take n random bits to generate integers uniformly between 0 and <span class="math inline">\(2^n\)</span> to more complex methods such as inverse transform sampling and rejection sampling <span class="citation" data-cites="devroyeRandomSampling1986"> [<a href="#ref-devroyeRandomSampling1986" role="doc-biblioref">1</a>]</span>.</p>
|
||||
<p>In physics the distribution we usually want to sample from is the Boltzmann probability over states of the system <span class="math inline">\(S\)</span>: <span class="math display">\[
|
||||
\begin{aligned}
|
||||
p(S) &= \frac{1}{\mathcal{Z}} e^{-\beta H(S)} \\
|
||||
p(S) &= \frac{1}{\mathcal{Z}} e^{-\beta H(S)}, \\
|
||||
\end{aligned}
|
||||
\]</span> where <span class="math inline">\(\mathcal{Z} = \sum_S e^{-\beta H(S)}\)</span> is the normalisation factor and ubiquitous partition function. In principle we could directly sample from this, for a discrete system there are finitely many choices. We could calculate the probability of each one and assign each a region of the unit interval which we could then sample uniformly from.</p>
|
||||
<p>However if we actually try to do this we will run into two problems, we can’t calculate <span class="math inline">\(\mathcal{Z}\)</span> for any reasonably sized systems because the state space grows exponentially with system size. Even if we could calculate <span class="math inline">\(\mathcal{Z}\)</span>, sampling from an exponentially large number of options quickly become tricky. This kind of problem happens in many other disciplines too, particularly when fitting statistical models using Bayesian inference <span class="citation" data-cites="BMCP2021"> [<a href="#ref-BMCP2021" role="doc-biblioref">2</a>]</span>.</p>
|
||||
<p>However, if we actually try to do this we will run into two problems, we can’t calculate <span class="math inline">\(\mathcal{Z}\)</span> for any reasonably sized systems because the state space grows exponentially with system size. Even if we could calculate <span class="math inline">\(\mathcal{Z}\)</span>, sampling from an exponentially large number of options quickly become tricky. This kind of problem happens in many other disciplines too, particularly when fitting statistical models using Bayesian inference <span class="citation" data-cites="BMCP2021"> [<a href="#ref-BMCP2021" role="doc-biblioref">2</a>]</span>.</p>
|
||||
</section>
|
||||
<section id="mcmc-sampling" class="level2">
|
||||
<h2>MCMC Sampling</h2>
|
||||
<p>So what can we do? Well it turns out that if we’re willing to give up in the requirement that the samples be uncorrelated then we can use MCMC instead.</p>
|
||||
<p>So what can we do? Well it turns out that if we are willing to give up in the requirement that the samples be uncorrelated then we can use MCMC instead.</p>
|
||||
<p>MCMC defines a weighted random walk over the states <span class="math inline">\((S_0, S_1, S_2, ...)\)</span>, such that in the long time limit, states are visited according to their probability <span class="math inline">\(p(S)\)</span>. <span class="citation" data-cites="binderGuidePracticalWork1988 kerteszAdvancesComputerSimulation1998 wolffMonteCarloErrors2004"> [<a href="#ref-binderGuidePracticalWork1988" role="doc-biblioref">3</a>–<a href="#ref-wolffMonteCarloErrors2004" role="doc-biblioref">5</a>]</span>. <span class="citation" data-cites="krauthIntroductionMonteCarlo1998"> [<a href="#ref-krauthIntroductionMonteCarlo1998" role="doc-biblioref">6</a>]</span></p>
|
||||
<p><span class="math display">\[\lim_{i\to\infty} p(S_i) = P(S)\]</span></p>
|
||||
<p><span class="math display">\[\lim_{i\to\infty} p(S_i) = P(S).\]</span></p>
|
||||
<p>In a physics context this lets us evaluate any observable with a mean over the states visited by the walk. <span class="math display">\[\begin{aligned}
|
||||
\langle O \rangle & = \sum_{S} p(S) \langle O \rangle_{S} = \sum_{i = 0}^{M} \langle O\rangle_{S_i} + \mathcal{O}(\tfrac{1}{\sqrt{M}})\\
|
||||
\langle O \rangle & = \sum_{S} p(S) \langle O \rangle_{S} = \sum_{i = 0}^{M} \langle O\rangle_{S_i} + \mathcal{O}(\tfrac{1}{\sqrt{M}}).\\
|
||||
\end{aligned}\]</span></p>
|
||||
<p>The samples in the random walk are correlated so the samples effectively contain less information than <span class="math inline">\(N\)</span> independent samples would. As a consequence the variance is larger than the <span class="math inline">\(\langle O^2 \rangle - \langle O\rangle^2\)</span> form it would have if the estimates were uncorrelated. Methods of estimating the true variance of <span class="math inline">\(\langle O \rangle\)</span> and decided how many steps are needed will be considered later.</p>
|
||||
</section>
|
||||
@ -128,17 +128,17 @@ p(S) &= \frac{1}{\mathcal{Z}} e^{-\beta H(S)} \\
|
||||
<p>We can quite easily write down the properties that <span class="math inline">\(\mathcal{T}\)</span> must have in order to yield the correct target distribution. Since we must transition somewhere at each step, we first have the normalisation condition that <span class="math display">\[\sum\limits_S \mathcal{T}(S' \rightarrow S) = 1.\]</span></p>
|
||||
<p>Second, let us move to an ensemble view, where rather than individual walkers and states, we think about the probability distribution of many walkers at each step. If we start all the walkers in the same place the initial distribution will be a delta function and as we step the walkers will wander around, giving us a sequence of probability distributions <span class="math inline">\(\{p_0(S), p_1(S), p_2(S)\ldots\}\)</span>. For discrete spaces we can write the action of the transition function on <span class="math inline">\(p_i\)</span> as a matrix equation</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
p_{i+1}(S) &= \sum_{S' \in \{S\}} p_i(S') \mathcal{T}(S' \rightarrow S)
|
||||
p_{i+1}(S) &= \sum_{S' \in \{S\}} p_i(S') \mathcal{T}(S' \rightarrow S).
|
||||
\end{aligned}\]</span></p>
|
||||
<p>This equation is essentially just stating that total probability mass is conserved as our walkers flow around the state space.</p>
|
||||
<p>In order that <span class="math inline">\(p_i\)</span> converges to our target distribution <span class="math inline">\(p\)</span> in the long time limit, we need the target distribution to be a fixed point of the transition function</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
P(S) &= \sum_{S'} P(S') \mathcal{T}(S' \rightarrow S)
|
||||
P(S) &= \sum_{S'} P(S') \mathcal{T}(S' \rightarrow S).
|
||||
\end{aligned}
|
||||
\]</span> Along with some more technical considerations such as ergodicity which won’t be considered here, global balance suffices to ensure that a MCMC method is correct <span class="citation" data-cites="kellyReversibilityStochasticNetworks1981"> [<a href="#ref-kellyReversibilityStochasticNetworks1981" role="doc-biblioref">7</a>]</span>.</p>
|
||||
<p>A sufficient but not necessary condition for global balance to hold is called detailed balance:</p>
|
||||
<p><span class="math display">\[
|
||||
P(S) \mathcal{T}(S \rightarrow S') = P(S') \mathcal{T}(S' \rightarrow S)
|
||||
P(S) \mathcal{T}(S \rightarrow S') = P(S') \mathcal{T}(S' \rightarrow S).
|
||||
\]</span></p>
|
||||
<p>In practice most algorithms are constructed to satisfy detailed rather than global balance, though there are arguments that the relaxed requirements of global balance can lead to faster algorithms <span class="citation" data-cites="kapferSamplingPolytopeHarddisk2013"> [<a href="#ref-kapferSamplingPolytopeHarddisk2013" role="doc-biblioref">8</a>]</span>.</p>
|
||||
<p>The goal of MCMC is then to choose <span class="math inline">\(\mathcal{T}\)</span> so that it has the desired thermal distribution <span class="math inline">\(P(S)\)</span> as its fixed point and converges quickly onto it. This boils down to requiring that the matrix representation of <span class="math inline">\(T_{ij} = \mathcal{T}(S_i \to S_j)\)</span> has an eigenvector with entries <span class="math inline">\(P_i = P(S_i)\)</span> with eigenvalue 1 and all other eigenvalues with magnitude less than one. The convergence time depends on the magnitude of the second largest eigenvalue.</p>
|
||||
@ -146,33 +146,33 @@ P(S) \mathcal{T}(S \rightarrow S') = P(S') \mathcal{T}(S' \rightarro
|
||||
</section>
|
||||
<section id="the-metropolis-hastings-algorithm" class="level2">
|
||||
<h2>The Metropolis-Hastings Algorithm</h2>
|
||||
<p>The Metropolis-Hastings algorithm breaks the transition function into a proposal distribution <span class="math inline">\(q(S \to S')\)</span> and an acceptance function <span class="math inline">\(\mathcal{A}(S \to S')\)</span>. <span class="math inline">\(q\)</span> must be a function we can directly sample from, and in many cases takes the form of flipping some number of spins in <span class="math inline">\(S\)</span>, i.e if we’re flipping a single random spin in the spin chain, <span class="math inline">\(q(S \to S')\)</span> is the uniform distribution on states reachable by one spin flip from <span class="math inline">\(S\)</span>. This also gives the symmetry property that <span class="math inline">\(q(S \to S') = q(S' \to S)\)</span>.</p>
|
||||
<p>The Metropolis-Hastings algorithm breaks the transition function into a proposal distribution <span class="math inline">\(q(S \to S')\)</span> and an acceptance function <span class="math inline">\(\mathcal{A}(S \to S')\)</span>. <span class="math inline">\(q\)</span> must be a function we can directly sample from, and in many cases takes the form of flipping some number of spins in <span class="math inline">\(S\)</span>, i.e., if we are flipping a single random spin in the spin chain, <span class="math inline">\(q(S \to S')\)</span> is the uniform distribution on states reachable by one spin flip from <span class="math inline">\(S\)</span>. This also gives the symmetry property that <span class="math inline">\(q(S \to S') = q(S' \to S)\)</span>.</p>
|
||||
<p>The proposal <span class="math inline">\(S'\)</span> is then accepted or rejected with an acceptance probability <span class="math inline">\(\mathcal{A}(S \to S')\)</span>, if the proposal is rejected then <span class="math inline">\(S_{i+1} = S_{i}\)</span>. Hence:</p>
|
||||
<p><span class="math display">\[\mathcal{T}(S\to S') = q(S\to S')\mathcal{A}(S \to S')\]</span></p>
|
||||
<p><span class="math display">\[\mathcal{T}(S\to S') = q(S\to S')\mathcal{A}(S \to S').\]</span></p>
|
||||
<p>The Metropolis-Hasting algorithm is a slight extension of the original Metropolis algorithm which allows for non-symmetric proposal distributions <span class="math inline">\(q(S\to S') \neq q(S'\to S)\)</span>. It can be derived starting from detailed balance <span class="citation" data-cites="krauthIntroductionMonteCarlo1998"> [<a href="#ref-krauthIntroductionMonteCarlo1998" role="doc-biblioref">6</a>]</span>:</p>
|
||||
<p><span class="math display">\[
|
||||
P(S)\mathcal{T}(S \to S') = P(S')\mathcal{T}(S' \to S)
|
||||
P(S)\mathcal{T}(S \to S') = P(S')\mathcal{T}(S' \to S),
|
||||
\]</span></p>
|
||||
<p>inserting the proposal and acceptance function</p>
|
||||
<p><span class="math display">\[
|
||||
P(S)q(S \to S')\mathcal{A}(S \to S') = P(S')q(S' \to S)\mathcal{A}(S' \to S)
|
||||
P(S)q(S \to S')\mathcal{A}(S \to S') = P(S')q(S' \to S)\mathcal{A}(S' \to S),
|
||||
\]</span></p>
|
||||
<p>rearranging gives us a condition on the acceptance function in terms of the target distribution and the proposal distribution which can be thought of as inputs to the algorithm</p>
|
||||
<p><span class="math display">\[
|
||||
\frac{\mathcal{A}(S \to S')}{\mathcal{A}(S' \to S)} = \frac{P(S')q(S' \to S)}{P(S)q(S \to S')} = f(S, S')
|
||||
\frac{\mathcal{A}(S \to S')}{\mathcal{A}(S' \to S)} = \frac{P(S')q(S' \to S)}{P(S)q(S \to S')} = f(S, S').
|
||||
\]</span></p>
|
||||
<p>The Metropolis-Hastings algorithm is the choice</p>
|
||||
<p><span class="math display">\[
|
||||
\begin{aligned}
|
||||
\label{eq:mh}
|
||||
\mathcal{A}(S \to S') = \min\left(1, f(S,S')\right)
|
||||
\mathcal{A}(S \to S') = \min\left(1, f(S,S')\right),
|
||||
\end{aligned}
|
||||
\]</span> for the acceptance function. The proposal distribution is left as a free choice.</p>
|
||||
<p>Noting that <span class="math inline">\(f(S,S') = 1/f(S',S)\)</span>, we can see that the MH algorithm satifies detailed balance by considering the two cases <span class="math inline">\(f(S,S') > 1\)</span> and <span class="math inline">\(f(S,S') < 1\)</span>.</p>
|
||||
<p>Noting that <span class="math inline">\(f(S,S') = 1/f(S',S)\)</span>, we can see that the MH algorithm satisfies detailed balance by considering the two cases <span class="math inline">\(f(S,S') > 1\)</span> and <span class="math inline">\(f(S,S') < 1\)</span>.</p>
|
||||
<p>By choosing the proposal distribution such that <span class="math inline">\(f(S,S')\)</span> is as close as possible to one, the rate of rejections can be reduced and the algorithm sped up. This can be challenging though, as getting <span class="math inline">\(f(S,S')\)</span> close to 1 would imply that we can directly sample from a distribution very close to the target distribution. As MCMC is usually applied to problems for which there is virtually no hope of sampling directly from the target distribution, it’s rare that one can do so approximately.</p>
|
||||
<p>When the proposal distribution is symmetric as ours is, it cancels out in the expression for the acceptance function and the Metropolis-Hastings algorithm is simply the choice</p>
|
||||
<p><span class="math display">\[
|
||||
\mathcal{A}(S \to S') = \min\left(1, e^{-\beta\;\Delta F}\right)
|
||||
\mathcal{A}(S \to S') = \min\left(1, e^{-\beta\;\Delta F}\right),
|
||||
\]</span></p>
|
||||
<p>where <span class="math inline">\(F\)</span> is the overall free energy of the system, including both the quantum and classical sector.</p>
|
||||
<p>To implement the acceptance function in practice we pick a random number in the unit interval and accept if it is less than <span class="math inline">\(e^{-\beta\;\Delta F}\)</span>:</p>
|
||||
@ -207,11 +207,11 @@ P(S)q(S \to S')\mathcal{A}(S \to S') = P(S')q(S' \to S)\mathcal{
|
||||
<span id="cb3-14"><a href="#cb3-14" aria-hidden="true" tabindex="-1"></a> states[i] <span class="op">=</span> current_state</span>
|
||||
<span id="cb3-15"><a href="#cb3-15" aria-hidden="true" tabindex="-1"></a> </span></code></pre></div>
|
||||
<p>As discussed in the main text, for the model parameters used, we find that with our new scheme the matrix diagonalisation is skipped around 30% of the time at <span class="math inline">\(T = 2.5\)</span> and up to 80% at <span class="math inline">\(T = 1.5\)</span>. We observe that for <span class="math inline">\(N = 50\)</span>, the matrix diagonalisation, if it occurs, occupies around 60% of the total computation time for a single step. This rises to 90% at N = 300 and further increases for larger N. We therefore get the greatest speedup for large system sizes at low temperature where many prospective transitions are rejected at the classical stage and the matrix computation takes up the greatest fraction of the total computation time. The upshot is that we find a speedup of up to a factor of 10 at the cost of very little extra algorithmic complexity.</p>
|
||||
<p>This modified scheme has the acceptance function <span class="math display">\[\mathcal{A}(a \to b) = \min\left(1, e^{-\beta \Delta H_s}\right)\min\left(1, e^{-\beta \Delta F_c}\right)\;.\]</span></p>
|
||||
<p>This modified scheme has the acceptance function <span class="math display">\[\mathcal{A}(a \to b) = \min\left(1, e^{-\beta \Delta H_s}\right)\min\left(1, e^{-\beta \Delta F_c}\right).\]</span></p>
|
||||
<p>We can show that this satisfies the detailed balance equations as follows. Defining <span class="math inline">\(r_c = e^{-\beta H_c}\)</span> and <span class="math inline">\(r_q = e^{-\beta F_q}\)</span> our target distribution is <span class="math inline">\(\pi(a) = r_c r_q\)</span>. This method has <span class="math inline">\(\mathcal{T}(a\to b) = q(a\to b)\mathcal{A}(a \to b)\)</span> with symmetric <span class="math inline">\(p(a \to b) = \pi(b \to a)\)</span> and <span class="math inline">\(\mathcal{A} = \min\left(1, r_c\right) \min\left(1, r_q\right)\)</span></p>
|
||||
<p>Substituting this into the detailed balance equation gives: <span class="math display">\[\mathcal{T}(a \to b)/\mathcal{T}(b \to a) = \pi(b)/\pi(a) = r_c r_q\]</span></p>
|
||||
<p>Substituting this into the detailed balance equation gives: <span class="math display">\[\mathcal{T}(a \to b)/\mathcal{T}(b \to a) = \pi(b)/\pi(a) = r_c r_q.\]</span></p>
|
||||
<p>Taking the LHS and substituting in our transition function: <span class="math display">\[\begin{aligned}
|
||||
\mathcal{T}(a \to b)/\mathcal{T}(b \to a) = \frac{\min\left(1, r_c\right) \min\left(1, r_q\right)}{ \min\left(1, 1/r_c\right) \min\left(1, 1/r_q\right)}\end{aligned}\]</span></p>
|
||||
\mathcal{T}(a \to b)/\mathcal{T}(b \to a) = \frac{\min\left(1, r_c\right) \min\left(1, r_q\right)}{ \min\left(1, 1/r_c\right) \min\left(1, 1/r_q\right)},\end{aligned}\]</span></p>
|
||||
<p>which simplifies to <span class="math inline">\(r_c r_q\)</span> as <span class="math inline">\(\min(1,r)/\min(1,1/r) = r\)</span> for <span class="math inline">\(r > 0\)</span>.</p>
|
||||
<section id="app-mcmc-autocorrelation" class="level3">
|
||||
<h3>Autocorrelation Time</h3>
|
||||
@ -219,9 +219,9 @@ P(S)q(S \to S')\mathcal{A}(S \to S') = P(S')q(S' \to S)\mathcal{
|
||||
<img src="/assets/thesis/fk_chapter/lsr/figs/m_autocorr.png" id="fig-m_autocorr" data-short-caption="Autocorrelation in MCMC" style="width:100.0%" alt="Figure 1: (Upper) 10 MCMC chains starting from the same initial state for a system with N = 150 sites and 3000 MCMC steps. At each MCMC step, n spins are flipped where n is drawn from Uniform(1,N) and this is repeated N^2/100 times. The simulations therefore have the potential to necessitate 10*N^2 matrix diagonalisations for each 100 MCMC steps. (Lower) The normalised autocorrelation (\langle m_i m_{i-j}\rangle - \langle m_i\rangle \langle m_i \rangle) / Var(m_i)) averaged over i. It can be seen that even with each MCMC step already being composed of many individual flip attempts, the autocorrelation is still non negligible and must be taken into account in the statistics. t = 1, \alpha = 1.25, T = 2.2, J = U = 5" />
|
||||
<figcaption aria-hidden="true">Figure 1: (Upper) 10 MCMC chains starting from the same initial state for a system with <span class="math inline">\(N = 150\)</span> sites and 3000 MCMC steps. At each MCMC step, n spins are flipped where n is drawn from Uniform(1,N) and this is repeated <span class="math inline">\(N^2/100\)</span> times. The simulations therefore have the potential to necessitate <span class="math inline">\(10*N^2\)</span> matrix diagonalisations for each 100 MCMC steps. (Lower) The normalised autocorrelation <span class="math inline">\((\langle m_i m_{i-j}\rangle - \langle m_i\rangle \langle m_i \rangle) / Var(m_i))\)</span> averaged over <span class="math inline">\(i\)</span>. It can be seen that even with each MCMC step already being composed of many individual flip attempts, the autocorrelation is still non negligible and must be taken into account in the statistics. <span class="math inline">\(t = 1, \alpha = 1.25, T = 2.2, J = U = 5\)</span></figcaption>
|
||||
</figure>
|
||||
<p>At this stage one might think we’re done. We can indeed draw independent samples from our target Boltzmann distribution by starting from some arbitrary initial state and doing <span class="math inline">\(k\)</span> steps to arrive at a sample. These are not, however, independent samples. In fig. <a href="#fig:m_autocorr">1</a> it is already clear that the samples of the order parameter <span class="math inline">\(m\)</span> have some autocorrelation because only a few spins are flipped each step. Even when the number of spins flipped per step is increased that it can be an important effect near the phase transition. Let’s define the autocorrelation time <span class="math inline">\(\tau(O)\)</span> informally as the number of MCMC samples of some observable O that are statistically equal to one independent sample or equivalently as the number of MCMC steps after which the samples are correlated below some cut-off, see <span class="citation" data-cites="krauthIntroductionMonteCarlo1996"> [<a href="#ref-krauthIntroductionMonteCarlo1996" role="doc-biblioref">9</a>]</span>. The autocorrelation time is generally shorter than the convergence time so it therefore makes sense from an efficiency standpoint to run a single walker for many MCMC steps rather than to run a huge ensemble for <span class="math inline">\(k\)</span> steps each.</p>
|
||||
<p>At this stage one might think we are done. We can indeed draw independent samples from our target Boltzmann distribution by starting from some arbitrary initial state and doing <span class="math inline">\(k\)</span> steps to arrive at a sample. These are not, however, independent samples. In fig. <a href="#fig:m_autocorr">1</a> it is already clear that the samples of the order parameter <span class="math inline">\(m\)</span> have some autocorrelation because only a few spins are flipped each step. Even when the number of spins flipped per step is increased that it can be an important effect near the phase transition. Let’s define the autocorrelation time <span class="math inline">\(\tau(O)\)</span> informally as the number of MCMC samples of some observable O that are statistically equal to one independent sample or equivalently as the number of MCMC steps after which the samples are correlated below some cut-off, see ref. <span class="citation" data-cites="krauthIntroductionMonteCarlo1996"> [<a href="#ref-krauthIntroductionMonteCarlo1996" role="doc-biblioref">9</a>]</span>. The autocorrelation time is generally shorter than the convergence time so it therefore makes sense from an efficiency standpoint to run a single walker for many MCMC steps rather than to run a huge ensemble for <span class="math inline">\(k\)</span> steps each.</p>
|
||||
<p>Once the random walk has been carried out for many steps, the expectation values of <span class="math inline">\(O\)</span> can be estimated from the MCMC samples <span class="math inline">\(S_i\)</span>: <span class="math display">\[
|
||||
\langle O \rangle = \sum_{i = 0}^{N} O(S_i) + \mathcal{O}(\frac{1}{\sqrt{N}})
|
||||
\langle O \rangle = \sum_{i = 0}^{N} O(S_i) + \mathcal{O}(\frac{1}{\sqrt{N}}).
|
||||
\]</span></p>
|
||||
<p>The samples are correlated so the N of them effectively contains less information than <span class="math inline">\(N\)</span> independent samples would, in fact roughly <span class="math inline">\(N/\tau\)</span> effective samples. As a consequence the variance is larger than the <span class="math inline">\(\langle O^2 \rangle - \langle O \rangle ^2\)</span> form it would have if the estimates were uncorrelated. There are many methods in the literature for estimating the true variance of <span class="math inline">\(\langle O \rangle\)</span> and deciding how many steps are needed but my approach has been to run a small number of parallel chains, which are independent, in order to estimate the statistical error produced. This is a slightly less computationally efficient because it requires throwing away those <span class="math inline">\(k\)</span> steps generated before convergence multiple times but it is conceptually simple.</p>
|
||||
</section>
|
||||
@ -239,7 +239,7 @@ P(S)q(S \to S')\mathcal{A}(S \to S') = P(S')q(S' \to S)\mathcal{
|
||||
<li>Choosing n from Uniform(1, N) and then flipping n sites for some fixed N.</li>
|
||||
<li>Attempting to tune the proposal distribution for each parameter regime.</li>
|
||||
</ol>
|
||||
<p>Fro fig. <a href="#fig:autocorr_multiple_proposals">2</a> we see that even at moderately high temperatures <span class="math inline">\(T > T_c\)</span> flipping one or two sites is the best choice. However for some simulations at very high temperature flipping more spins is warranted.</p>
|
||||
<p>Fro fig. <a href="#fig:autocorr_multiple_proposals">2</a> we see that even at moderately high temperatures <span class="math inline">\(T > T_c\)</span> flipping one or two sites is the best choice. However, for some simulations at very high temperature flipping more spins is warranted.</p>
|
||||
<p>Next Section: <a href="../6_Appendices/A.3_Lattice_Generation.html">Lattice Generation</a></p>
|
||||
</section>
|
||||
</section>
|
||||
|
@ -78,7 +78,7 @@ image:
|
||||
<p>Three key pieces of information allow us to represent amorphous lattices. The majority of the graph connectivity is encoded by an ordered list of edges <span class="math inline">\((i,j)\)</span>. These are ordered to represent both directed and undirected graphs. This is useful for defining the sign of bond operators <span class="math inline">\(u_{ij} = - u_{ji}\)</span>.</p>
|
||||
<p>Information about the embedding of the lattice onto the torus is encoded into a point on the unit square associated with each vertex. The torus is unwrapped onto the square by defining an arbitrary pair of cuts along the major and minor axes. For simplicity, we take these axes to be the lines <span class="math inline">\(x = 0\)</span> and <span class="math inline">\(y = 0\)</span>. We can wrap the unit square back up into a torus by identifying the lines <span class="math inline">\(x = 0\)</span> with <span class="math inline">\(x = 1\)</span> and <span class="math inline">\(y = 0\)</span> with <span class="math inline">\(y = 1\)</span>.</p>
|
||||
<p>Finally, we need to encode the topology of the graph. This is necessary because, if we are simply given an edge <span class="math inline">\((i, j)\)</span> we do not know how the edge gets from vertex i to vertex j. One method would be taking the shortest path, but it could also ‘go the long way around’ by crossing one of the cuts. To encode this information, we store an additional vector <span class="math inline">\(\vec{r}\)</span> associated with each edge. <span class="math inline">\(r_i^x = 0\)</span> means that edge i does not cross the x. <span class="math inline">\(r_i^x = +1\)</span> (<span class="math inline">\(-1\)</span>) means it crossed the cut in a positive (negative) sense.</p>
|
||||
<p>This description of the lattice has a very nice relationship to Bloch’s theorem. Applying Bloch’s theorem to a periodic lattice essentially means wrappping the unit cell onto a torus. Variations that happen at longer length scales than the size of the unit cell are captured by the crystal momentum. The crystal momentum inserts a phase factor <span class="math inline">\(e^{i \vec{q}\cdot\vec{r}}\)</span> onto bonds that cross to adjacent unit cells. The vector <span class="math inline">\(\vec{r}\)</span> is exactly what we use to encode the topology of our lattices.</p>
|
||||
<p>This description of the lattice has a very nice relationship to Bloch’s theorem. Applying Bloch’s theorem to a periodic lattice essentially means wrapping the unit cell onto a torus. Variations that happen at longer length scales than the size of the unit cell are captured by the crystal momentum. The crystal momentum inserts a phase factor <span class="math inline">\(e^{i \vec{q}\cdot\vec{r}}\)</span> onto bonds that cross to adjacent unit cells. The vector <span class="math inline">\(\vec{r}\)</span> is exactly what we use to encode the topology of our lattices.</p>
|
||||
<figure>
|
||||
<img src="/assets/thesis/amk_chapter/methods/bloch.png" id="fig-bloch" data-short-caption="Bloch's Theorem and the Torus" style="width:100.0%" alt="Figure 1: Bloch’s theorem can be thought of as transforming from a periodic Hamiltonian on the plane to the unit cell defined on a torus. In addition we get some phase factors e^{i\vec{k}\cdot\vec{r}} associated with bonds that cross unit cells that depend on the sense in which they do so \vec{r} = (\pm1, \pm1). Representing graphs on the torus turns out to require a similar idea, we unwrap the torus to one unit cell and keep track of which bonds cross the cell boundaries." />
|
||||
<figcaption aria-hidden="true">Figure 1: Bloch’s theorem can be thought of as transforming from a periodic Hamiltonian on the plane to the unit cell defined on a torus. In addition we get some phase factors <span class="math inline">\(e^{i\vec{k}\cdot\vec{r}}\)</span> associated with bonds that cross unit cells that depend on the sense in which they do so <span class="math inline">\(\vec{r} = (\pm1, \pm1)\)</span>. Representing graphs on the torus turns out to require a similar idea, we unwrap the torus to one unit cell and keep track of which bonds cross the cell boundaries.</figcaption>
|
||||
@ -91,7 +91,7 @@ image:
|
||||
<figcaption aria-hidden="true">Figure 2: The proportion of computation time taken up by the four longest running steps when generating a lattice. For larger systems, the time taken to perform the diagonalisation dominates.</figcaption>
|
||||
</figure>
|
||||
<p>In the main text we discuss the problem of three-edge-colouring, assigning one of three labels to each edge of a graph such that no edges of the same label meet at a vertex. To solve this in practice I use a solver called <code>MiniSAT</code> <span class="citation" data-cites="imms-sat18"> [<a href="#ref-imms-sat18" role="doc-biblioref">1</a>]</span>. Like most modern SAT solvers, <code>MiniSAT</code> requires the input problem to be specified in Conjunctive Normal Form (CNF). CNF requires that the constraints be encoded as a set of <em>clauses</em> of the form</p>
|
||||
<p><span class="math display">\[x_1 \;\textrm{or}\; -x_3 \;\textrm{or}\; x_5\]</span></p>
|
||||
<p><span class="math display">\[x_1 \;\textrm{or}\; -x_3 \;\textrm{or}\; x_5,\]</span></p>
|
||||
<p>that contain logical ORs of some subset of the variables where any of the variables may also be logically NOT’d, which we represent by negation here. A solution of the problem is one that makes all the clauses simultaneously true.</p>
|
||||
<p>I encode the edge colouring problem by assigning <span class="math inline">\(3B\)</span> boolean variables to each of the <span class="math inline">\(B\)</span> edges of the graph, <span class="math inline">\(x_{i\alpha}\)</span> where <span class="math inline">\(x_{i\alpha} = 1\)</span> indicates that edge <span class="math inline">\(i\)</span> has colour <span class="math inline">\(\alpha\)</span>. For edge colouring graphs we need two types of constraints: 1. Each edge is exactly one colour. 2. No neighbouring edges are the same colour.</p>
|
||||
<p>The first constraint is a product of doing this mapping to boolean variables. The solver does not know anything about the structure of the problem unless it is encoded into the variables. Let’s say we have three variables that correspond to particular edge being red <span class="math inline">\(r\)</span>, green <span class="math inline">\(g\)</span> or blue <span class="math inline">\(b\)</span>. To require that exactly one of the variables be true, we can enforce that no pair of variables be true: <code>-(r and b) -(r and g) -(b and g)</code></p>
|
||||
|
@ -74,9 +74,9 @@ image:
|
||||
<p>A four-face-colouring can be converted into a three-edge-colouring quite easily: 1. Assume the faces of G can be four-coloured with labels (0,1,2,3) 2. Label each edge of G according to <span class="math inline">\(i + j \;\textrm{mod}\; 3\)</span> where i and j are the labels of the face adjacent to that edge. For each edge label there are two face label pairs that do not share any face labels. i,e the edge label <span class="math inline">\(0\)</span> can come about either from faces <span class="math inline">\(0 + 3\)</span> or <span class="math inline">\(1 + 2\)</span>.</p>
|
||||
<p>Explicitly, the mapping from face labels to edge labels is:</p>
|
||||
<p><span class="math display">\[\begin{aligned}
|
||||
0 + 3 \;\mathrm{or}\; 1 + 2 &= 0 \;\mathrm{mod}\; 3\\
|
||||
0 + 1 \;\mathrm{or}\; 2 + 3 &= 1 \;\mathrm{mod}\; 3\\
|
||||
0 + 2 \;\mathrm{or}\;1 + 3 &= 2 \;\mathrm{mod}\; 3\\
|
||||
0 + 3 \;\mathrm{or}\; 1 + 2 &= 0 \;\mathrm{mod}\; 3,\\
|
||||
0 + 1 \;\mathrm{or}\; 2 + 3 &= 1 \;\mathrm{mod}\; 3,\\
|
||||
0 + 2 \;\mathrm{or}\;1 + 3 &= 2 \;\mathrm{mod}\; 3.\\
|
||||
\end{aligned}
|
||||
\]</span></p>
|
||||
<ol start="3" type="1">
|
||||
|
@ -71,19 +71,19 @@ image:
|
||||
<figcaption>The relationship between the different Hilbert spaces used in the solution. __needs updating__ </figcaption>
|
||||
</figure> -->
|
||||
<p>The physical states are defined as those for which <span class="math inline">\(D_i |\phi\rangle = |\phi\rangle\)</span> for all <span class="math inline">\(D_i\)</span>. Since <span class="math inline">\(D_i\)</span> has eigenvalues <span class="math inline">\(\pm1\)</span>, the quantity <span class="math inline">\(\tfrac{(1+D_i)}{2}\)</span> has eigenvalue <span class="math inline">\(1\)</span> for physical states and <span class="math inline">\(0\)</span> for extended states so is the local projector onto the physical subspace.</p>
|
||||
<p>Therefore, the global projector is <span class="math display">\[ \mathcal{P} = \prod_{i=1}^{2N} \left( \frac{1 + D_i}{2}\right)\]</span></p>
|
||||
<p>for a toroidal trivalent lattice with <span class="math inline">\(N\)</span> plaquettes <span class="math inline">\(2N\)</span> vertices and <span class="math inline">\(3N\)</span> edges. As discussed earlier, the product over <span class="math inline">\((1 + D_j)\)</span> can also be thought of as the sum of all possible subsets <span class="math inline">\(\{i\}\)</span> of the <span class="math inline">\(D_j\)</span> operators, which is the set of all possible gauge symmetry operations.</p>
|
||||
<p><span class="math display">\[ \mathcal{P} = \frac{1}{2^{2N}} \sum_{\{i\}} \prod_{i\in\{i\}} D_i\]</span></p>
|
||||
<p>Therefore, the global projector is <span class="math display">\[ \mathcal{P} = \prod_{i=1}^{2N} \left( \frac{1 + D_i}{2}\right),\]</span></p>
|
||||
<p>for a toroidal trivalent lattice with <span class="math inline">\(N\)</span> plaquettes <span class="math inline">\(2N\)</span> vertices and <span class="math inline">\(3N\)</span> edges. As discussed earlier, the product over <span class="math inline">\((1 + D_j)\)</span> can also be thought of as the sum of all possible subsets <span class="math inline">\(\{i\}\)</span> of the <span class="math inline">\(D_j\)</span> operators, which is the set of all possible gauge symmetry operations</p>
|
||||
<p><span class="math display">\[ \mathcal{P} = \frac{1}{2^{2N}} \sum_{\{i\}} \prod_{i\in\{i\}} D_i.\]</span></p>
|
||||
<p>Since the gauge operators <span class="math inline">\(D_j\)</span> commute and square to one, we can define the complement operator <span class="math inline">\(C = \prod_{i=1}^{2N} D_i\)</span> and see that it takes each set of <span class="math inline">\(\prod_{i \in \{i\}} D_j\)</span> operators and gives us the complement of that set. We will shortly see why <span class="math inline">\(C\)</span> is the identity in the physical subspace, as noted earlier.</p>
|
||||
<p>We use the complement operator to rewrite the projector as a sum over half the subsets of <span class="math inline">\(\{i\}\)</span> - referred to as <span class="math inline">\(\Lambda\)</span>. The complement operator deals with the other half</p>
|
||||
<p><span class="math display">\[ \mathcal{P} = \left( \frac{1}{2^{2N-1}} \sum_{\Lambda} \prod_{i\in\{i\}} D_i\right) \left(\frac{1 + \prod_i^{2N} D_i}{2}\right) = \mathcal{S} \cdot \mathcal{P}_0\]</span></p>
|
||||
<p><span class="math display">\[ \mathcal{P} = \left( \frac{1}{2^{2N-1}} \sum_{\Lambda} \prod_{i\in\{i\}} D_i\right) \left(\frac{1 + \prod_i^{2N} D_i}{2}\right) = \mathcal{S} \cdot \mathcal{P}_0.\]</span></p>
|
||||
<p>To compute <span class="math inline">\(\mathcal{P}_0\)</span>, the main quantity needed is the product of the local projectors <span class="math inline">\(D_i\)</span> <span class="math display">\[\prod_i^{2N} D_i = \prod_i^{2N} b^x_i b^y_i b^z_i c_i \]</span> for a toroidal trivalent lattice with <span class="math inline">\(N\)</span> plaquettes <span class="math inline">\(2N\)</span> vertices and <span class="math inline">\(3N\)</span> edges.</p>
|
||||
<p>First, we reorder the operators by bond type. This does not require any information about the underlying lattice.</p>
|
||||
<p><span class="math display">\[\prod_i^{2N} D_i = \prod_i^{2N} b^x_i \prod_i^{2N} b^y_i \prod_i^{2N} b^z_i \prod_i^{2N} c_i\]</span></p>
|
||||
<p>The product over <span class="math inline">\(c_i\)</span> operators reduces to a determinant of the Q matrix and the fermion parity, see <span class="citation" data-cites="pedrocchiPhysicalSolutionsKitaev2011"> [<a href="#ref-pedrocchiPhysicalSolutionsKitaev2011" role="doc-biblioref">1</a>]</span>. The only difference from the honeycomb case is that we cannot explicitly compute the factors <span class="math inline">\(p_x,p_y,p_z = \pm\;1\)</span> that arise from reordering the b operators such that pairs of vertices linked by the corresponding bonds are adjacent.</p>
|
||||
<p><span class="math display">\[\prod_i^{2N} b^\alpha_i = p_\alpha \prod_{(i,j)}b^\alpha_i b^\alpha_j\]</span></p>
|
||||
<p>First, we reorder the operators by bond type. This does not require any information about the underlying lattice,</p>
|
||||
<p><span class="math display">\[\prod_i^{2N} D_i = \prod_i^{2N} b^x_i \prod_i^{2N} b^y_i \prod_i^{2N} b^z_i \prod_i^{2N} c_i.\]</span></p>
|
||||
<p>The product over <span class="math inline">\(c_i\)</span> operators reduces to a determinant of the Q matrix and the fermion parity, see ref. <span class="citation" data-cites="pedrocchiPhysicalSolutionsKitaev2011"> [<a href="#ref-pedrocchiPhysicalSolutionsKitaev2011" role="doc-biblioref">1</a>]</span>. The only difference from the honeycomb case is that we cannot explicitly compute the factors <span class="math inline">\(p_x,p_y,p_z = \pm\;1\)</span> that arise from reordering the b operators such that pairs of vertices linked by the corresponding bonds are adjacent,</p>
|
||||
<p><span class="math display">\[\prod_i^{2N} b^\alpha_i = p_\alpha \prod_{(i,j)}b^\alpha_i b^\alpha_j.\]</span></p>
|
||||
<p>However, they are simply the parity of the permutation from one ordering to the other and can be computed in linear time with a cycle decomposition <span class="citation" data-cites="sedgewickPermutationGenerationMethods1977"> [<a href="#ref-sedgewickPermutationGenerationMethods1977" role="doc-biblioref">2</a>]</span>.</p>
|
||||
<p>We find that <span class="math display">\[\mathcal{P}_0 = 1 + p_x\;p_y\;p_z\; \hat{\pi} \; \mathrm{det}(Q^u) \; \prod_{\{i,j\}} -iu_{ij}\]</span></p>
|
||||
<p>We find that <span class="math display">\[\mathcal{P}_0 = 1 + p_x\;p_y\;p_z\; \hat{\pi} \; \mathrm{det}(Q^u) \; \prod_{\{i,j\}} -iu_{ij},\]</span></p>
|
||||
<p>where <span class="math inline">\(p_x\;p_y\;p_z = \pm 1\)</span> are lattice structure factors and <span class="math inline">\(\mathrm{det}(Q^u)\)</span> is the determinant of the matrix mentioned earlier that maps <span class="math inline">\(c_i\)</span> operators to normal mode operators <span class="math inline">\(b'_i, b''_i\)</span>. These depend only on the lattice structure.</p>
|
||||
<p><span class="math inline">\(\hat{\pi} = \prod{i}^{N} (1 - 2\hat{n}_i)\)</span> is the parity of the particular many-body state determined by fermionic occupation numbers <span class="math inline">\(n_i\)</span>. As discussed in <span class="citation" data-cites="pedrocchiPhysicalSolutionsKitaev2011"> [<a href="#ref-pedrocchiPhysicalSolutionsKitaev2011" role="doc-biblioref">1</a>]</span>, <span class="math inline">\(\hat{\pi}\)</span> is gauge invariant in the sense that <span class="math inline">\([\hat{\pi}, D_i] = 0\)</span>.</p>
|
||||
<p>This implies that <span class="math inline">\(det(Q^u) \prod -i u_{ij}\)</span> is also a gauge invariant quantity. In translation invariant models this quantity which can be related to the parity of the number of vortex pairs in the system <span class="citation" data-cites="yaoAlgebraicSpinLiquid2009"> [<a href="#ref-yaoAlgebraicSpinLiquid2009" role="doc-biblioref">3</a>]</span>.</p>
|
||||
|
@ -1,7 +1,7 @@
|
||||
<ul>
|
||||
<li><a href="./1_Introduction/1_Intro.html">1 Introduction</a></li>
|
||||
<ul>
|
||||
<li><a href="./1_Introduction/1_Intro.html#interacting-quantum-many-body-systems">Interacting Quantum Many Body Systems</a></li>
|
||||
<li><a href="./1_Introduction/1_Intro.html#interacting-quantum-many-body-systems">Interacting Quantum Many-Body Systems</a></li>
|
||||
<li><a href="./1_Introduction/1_Intro.html#mott-insulators">Mott Insulators</a></li>
|
||||
<li><a href="./1_Introduction/1_Intro.html#quantum-spin-liquids">Quantum Spin Liquids</a></li>
|
||||
</ul>
|
||||
|
Before Width: | Height: | Size: 823 KiB After Width: | Height: | Size: 812 KiB |
Before Width: | Height: | Size: 119 KiB After Width: | Height: | Size: 120 KiB |
Before Width: | Height: | Size: 41 KiB After Width: | Height: | Size: 41 KiB |
Before Width: | Height: | Size: 38 KiB After Width: | Height: | Size: 52 KiB |