The use of flavins and flavoproteins in photocatalytic, sensing, and biotechnological applications has led to a growing interest in computationally modeling the excited-state electronic structure and photophysics of flavin. However, there is limited consensus regarding which computational methods are appropriate for modeling flavin's photophysics. We compare the energies of low-lying excited states of flavin computed with time-dependent density functional theory (TD-DFT), equation-of-motion coupled cluster (EOM-EE-CCSD), scaled opposite-spin configuration interaction [SOS-CIS(D)], multiconfiguration pair-density functional theory (MC-PDFT), and several multireference perturbation theory (MR-PT2) methods. In the first part, we focus on excitation energies of the first singlet excited state (S1) of five different redox and protonation states of flavin, with the goal of finding a suitable active space for MR-PT2 calculations. In the second part, we construct two sets of one-dimensional potential energy surfaces connecting the S0 and S1 equilibrium geometries (S0-S1 path) and the S1 (π,π*) and S2 (n,π*) equilibrium geometries (S1-S2 path). The first path therefore follows a Franck-Condon active mode of flavin while the second path maps crossings points between low-lying singlet and triplet states in flavin. We discuss the similarities and differences in the TD-DFT, EOM-EE-CCSD, SOS-CIS(D), MC-PDFT and MR-PT2 energy profiles along these paths. We find that (TD-)DFT methods are suitable for applications such as simulating the spectra of flavins but are inconsistent with several other methods when used for some geometry optimizations and when describing the energetics of dark (n,π*) states. MR-PT2 methods show promise for the simulation of flavin's low-lying excited states, but the selection of orbitals for the active space and the number of roots used for state averaging must be done carefully to avoid artifacts. Some properties, such as the intersystem crossing geometry and energy between the S1 (π,π*) and T2 (n,π*) states, may require additional benchmarking before they can be determined quantitatively.