A striking characteristic of human memory is that pictures are remembered better than words. We examined the neural correlates of memory for pictures and words in the context of episodic memory encoding to determine material-specific differences in brain activity patterns. To do this, we used positron emission tomography to map the brain regions active during encoding of words and pictures of objects. Encoding was carried out by using three different strategies to explore possible interactions between material specificity and types of processing. Encoding of pictures resulted in greater activity of bilateral visual and medial temporal cortices, compared with encoding words, whereas encoding of words was associated with increased activity in prefrontal and temporoparietal regions related to language function. Each encoding strategy was characterized by a distinctive activity pattern, but these patterns were largely the same for pictures and words. Thus, superior overall memory for pictures may be mediated by more effective and automatic engagement of areas important for visual memory, including medial temporal cortex, whereas the mechanisms underlying specific encoding strategies appear to operate similarly on pictures and words.