5.13. Exploratory Data Analysis with Matplotlib and Seaborn: Save visualizations to files (PNG, JPG, etc.)

Exploratory data analysis (EDA) is a fundamental step in the machine learning and deep learning process, as it allows you to better understand the characteristics, patterns and relationships present in the data. Python, being one of the most popular languages ​​for data science, offers powerful libraries like Matplotlib and Seaborn for data visualization. A crucial part of EDA is the ability to save created visualizations in files such as PNG, JPG, and other formats, so that they can be used in reports, presentations, or simply archived for future reference.

Matplotlib: An Introduction

Matplotlib is a plotting library for the Python programming language and its numerical extension package NumPy. It provides an object-oriented interface for embedding graphics in applications using GUI toolkits such as Tkinter, wxPython, Qt or GTK. Additionally, Matplotlib can be used in Python scripts, Python and IPython shells, the Jupyter notebook, web application servers, and four GUI toolkits.

Seaborn: An Introduction

Seaborn is a Python data visualization library based on Matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphs. Seaborn is particularly suited to exploring and understanding data through high-quality graphs. It works well with pandas DataFrames and assumes the data is in a clean format suitable for visualizations.

Saving Visualizations with Matplotlib

To save graphs with Matplotlib, you can use the savefig() method of the Figure object. This method is very flexible and allows you to specify various parameters to control the output, such as resolution (DPI), size, file format, and more. Here is a basic example of how to save a chart:


import matplotlib.pyplot as plt

# Creating a simple graph
plt.plot([1, 2, 3], [4, 5, 6])
plt.title('Chart Example')

# Saving the chart to a PNG file
plt.savefig('my_graphic.png')

# You can also specify the DPI and size of the graphic
plt.savefig('meu_grafico_alta_resolucao.png', dpi=300)
plt.savefig('my_graphic_size_specific.png', figsize=(10, 8))

In addition to PNG, you can save graphics in other formats such as JPG, SVG, PDF, among others, simply by changing the file extension in the savefig() method.

Saving Views with Seaborn

Because Seaborn is built on top of Matplotlib, the process of saving plots is very similar. However, Seaborn adds some features and styles that can be very useful. Here is an example of how to save a Seaborn chart:


import seaborn as sns
import matplotlib.pyplot as plt

# Loading an example dataset
tips = sns.load_dataset('tips')

# Creating a bar chart with Seaborn
sns.barplot(x='day', y='total_bill', data=tips)

# Saving the graph created with Seaborn
plt.savefig('grafico_seaborn.png')

It's important to note that when using Seaborn, you are still working with Matplotlib objects. Therefore, all functionality of savefig() is still available.

Advanced Settings When Saving Charts

When you are saving graphics to be included in publications or presentations, you may need to configure additional details such as transparency, quality, and margins. Here are some tips:

  • Transparency: To save a chart with a transparent background, use the transparent=True argument in the savefig() method.
  • Quality: For raster graphics (such as PNG or JPG), the dpi argument controls the quality of the image. A higher value results in a sharper and larger image.
  • Margins: Sometimes charts can be saved with unwanted margins. You can use plt.tight_layout() before saving the figure to optimize space usage.

# Saving with transparent background and optimized margins
plt.savefig('grafico_transparente.png', transparent=True, bbox_inches='tight')

In conclusion, both Matplotlib and Seaborn offer robust tools for creating and saving data visualizations. The ability to save graphs efficiently and with high quality is essential for communicating the results of your exploratory data analysis. With practice, you'll be able to create visualizations that not only reveal valuable insights into your data but also stand out in terms of clarity and visual impact.

Now answer the exercise about the content:

Which of the following methods is used to save data visualizations created with Matplotlib and Seaborn to files such as PNG, JPG, and others?

You are right! Congratulations, now go to the next page

You missed! Try again.

Article image Exploratory Data Analysis with Matplotlib and Seaborn: Interpretation and conclusions from visualizations

Next page of the Free Ebook:

19Exploratory Data Analysis with Matplotlib and Seaborn: Interpretation and conclusions from visualizations

4 minutes

Obtenez votre certificat pour ce cours gratuitement ! en téléchargeant lapplication Cursa et en lisant lebook qui sy trouve. Disponible sur Google Play ou App Store !

Get it on Google Play Get it on App Store

+ 6.5 million
students

Free and Valid
Certificate with QR Code

48 thousand free
exercises

4.8/5 rating in
app stores

Free courses in
video, audio and text