Replace iterrows() with itertuples() for better performance

https://github.com/bluehands/howto-optimize-co2-footprint-cloud-application-in-azure/blob/e7d5df73029b5523a7faa0daadadff32639ab85a/src/Auswertung/main.py#L32
Current code:
```
for index, row in x.iterrows():
    ...

```
Recommended replacement:
```
for row in x.itertuples(index=True):
    ...

```
Using iterrows() returns each row as a Pandas Series, which introduces overhead from object creation, type inference, and dictionary-based access. Every row iteration allocates a new Series object and resolves columns via dynamic indexing, which can become a major performance bottleneck when iterating over large DataFrames.

On the other hand, itertuples() yields each row as a lightweight namedtuple, constructed in Cython. This avoids unnecessary object overhead and allows fast, attribute-style access to columns. It is significantly faster and more memory-efficient than iterrows(), making it the preferred method for row-wise access when mutation is not required.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replace iterrows() with itertuples() for better performance #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Replace iterrows() with itertuples() for better performance #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions