The assessment of Diffusion-Weighted MRI (DW-MRI) fibre-tracking algorithms has been limited by the lack of an appropriate 'gold standard'. Practical limitations of alternative methods and physical models have meant that numerical simulations have become the method of choice in practice. However, previous numerical phantoms have consisted of separate fibres embedded in homogeneous backgrounds, which do not capture the true nature of white matter. In this paper we describe a method that is able to randomly generate numerical structures consisting of densely packed bundles of fibres, which are much more representative of human white matter, and simulate the DW-MR images that would arise from them under many imaging conditions. User-defined parameters may be adjusted to produce structures with a range of complexities that spans the levels we would expect to find in vivo. These structures are shown to contain many different features that occur in human white matter and which could confound fibre-tracking algorithms, such as tract kissing and crossing. Furthermore, combinations of such features can be sampled by the random generation of many different structures with consistent levels of complexity. The proposed software provides means for quantitative assessment via direct comparison between tracking results and the exact location of the generated fibres. This should greatly improve our understanding of algorithm performance and therefore prove an important tool for fibre tracking development.