Mixing Genres and Modes

Instagram and TikTok are popular forums for sharing images, videos, and other types of content. Think about your purpose and envision how these media can communicate your message. Read this text on how to choose the appropriate text and visual images. How do these guidelines apply to other forms of social media?

The writing genre for this chapter incorporates a variety of modalities. A genre is a type of composition that encompasses defined features, follows a style or format, and reflects your purpose as a writer. For example, given the composition types romantic comedy, poetry, or documentary, you probably can think easily of features of each of these composition types. When considering multimodal genres, you will discover that genres create conventions (standard ways of doing things) for categorizing media according to the expectations of the audience and the way the media will be consumed. Consider film media, for example; it encompasses genres including drama, documentaries, and animated shorts, to name a few. Each genre has its own conventions or features. When you write or analyze multimodal texts, it is important to account for genre conventions.

A note on the text: typically, when referring to text, people mean written words. But in multimodal genres, the term text can refer to a piece of communication as a whole, incorporating written words, images, sounds, and even movement. The following images are examples of multimodal texts.

A laptop and a mobile phone appear on a table. The laptop display shows a webpage with a combination of words and images.


Figure 18.3 A website is a multimodal text that combines images and words. It may incorporate sound and animation as well.

Four dancers perform a traditional Okinawan dance.

Figure 18.4 A theatrical performance, as shown in this dance image, is a multimodal text combining drama, music, movement, and visual design. Other theatrical performances combine different elements.

An infographic about research contains a mixture of images, text, and white space. It contains images of a person, ID card, l

Figure 18.5 An infographic is a multimodal text that combines words and different types of graphics to convey information succinctly.


Multimodal genres are uniquely positioned to address audiences through a variety of modes or types of communication. These can be identified in the following categories:

  • Linguistic text: The most common mode for writing, the linguistic mode includes written or spoken text.

  • Visuals: The visual mode includes anything the reader can see, including images, colors, lighting, typefaces, lines, shapes, and backgrounds.

  • Audio: The audio mode includes all types of sound, such as narration, sound effects, music, silence, and ambient noise.

  • Spatial: Especially important in digital media, the spatial mode includes spacing, image and text size and position, white space, visual organization, and alignment.

  • Gestural: The gestural mode includes communication through all kinds of body language, including movement and facial expressions.


The multimodal composition provides an opportunity for you to develop and practice skills that will translate to future coursework and career opportunities. Creating a multimodal text requires you to demonstrate aptitude in various modes and reflects the requirements for communication skills beyond the academic world. In other words, although multimodal creations may seem to be little more than pictures and captions at times, they must be carefully constructed to be effective. Even the simplest compositions are meticulously planned and executed. Multimodal compositions may include written text, such as blog post text, slideshow text, and website content; image-based content, such as infographics and photo essays; or audiovisual content, including podcasts, public service announcements, and videos.

A soldier scans the ridge through binoculars.

Figure 18.6 Multimodal compositions aren't always complicated. Even a single photo can create a major impact on readers.


Multimodal composition is especially important in a 21st-century world where communication must represent and transfer across cultural contexts. Because using multiple modes helps a writer make meaning in different channels (media that communicate a message), the availability of different modes is especially important to help you make yourself understood as an author. In academic settings, multimodal content creation increases engagement, improves equity, and helps prepare you to be a global citizen. The same is true for your readers. Multimodal composition is important in addressing and supporting cultural and linguistic diversity. Modes are shaped by social, cultural, and historic factors, all of which influence their use and impact on communication. And it isn't just readers who benefit from multimodal composition. Combining a variety of modes allows you as a composer to connect to your own lived experiences – the representation of experiences and choices that you have faced in your own life – and helps you develop a unique voice, thus leveraging your knowledge and experiences.


Source: Michelle Bachelor Robinson, Maria Jerskey, and Toby Fulwiler, https://openstax.org/books/writing-guide/pages/18-1-mixing-genres-and-modes
Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 License.

Last modified: Saturday, July 15, 2023, 1:46 PM