Allow ExcelWriter() to add sheets to existing workbook #3441

ligon · 2013-04-23T22:22:18Z

The ability of ExcelWriter to save different dataframes to different worksheets is great for sharing those dfs with the python-deficient. But this quickly leads to a need to add worksheets to an existing workbook, not just creating one from scratch; something like:

df0=pd.DataFrame(np.arange(3))
df0.to_excel('foo.xlsx','Data 0')

df1=pd.DataFrame(np.arange(2))
df1.to_excel('foo.xlsx','Data 1')

The following little diff to io/parsers.py implements this behavior for *.xlsx files:

diff --git a/pandas/io/parsers.py b/pandas/io/parsers.py
index 89f892d..7f010ee 100644
--- a/pandas/io/parsers.py
+++ b/pandas/io/parsers.py
@@ -2099,12 +2099,19 @@ class ExcelWriter(object):
             self.fm_date = xlwt.easyxf(num_format_str='YYYY-MM-DD')
         else:
             from openpyxl.workbook import Workbook
-            self.book = Workbook()  # optimized_write=True)
-            # open pyxl 1.6.1 adds a dummy sheet remove it
-            if self.book.worksheets:
-                self.book.remove_sheet(self.book.worksheets[0])
+            from openpyxl.reader.excel import load_workbook
+
+            try:
+               self.book=load_workbook(filename = path)
+               self.sheets={wks.title:wks for wks in self.book.worksheets}
+            except InvalidFileException:
+                self.book = Workbook()  # optimized_write=True)
+                # open pyxl 1.6.1 adds a dummy sheet remove it
+                if self.book.worksheets:
+                    self.book.remove_sheet(self.book.worksheets[0])
+                self.sheets = {}
+
         self.path = path
-        self.sheets = {}
         self.cur_sheet = None

Doing this for *.xls files is a little harder.

The text was updated successfully, but these errors were encountered:

jreback · 2013-09-22T00:40:07Z

@jtratner is this still a bug/needed enhancement?

jtratner · 2013-09-22T01:08:25Z

Because of how to_excel is set up, this would mean reading in and then writing the file each time (because to_excel with a path argument saves the file). The right way to do this is to use ExcelWriter:

import pandas as pd
writer = pd.ExcelWriter('foo.xlsx')
df.to_excel(writer, 'Data 0')
df.to_excel(writer, 'Data 1')
writer.save()

I could see (eventually) adding an option to ExcelWriter that doesn't overwrite the file. But, yet again, that may mean writing in the entire file first. I don't know.

jtratner · 2013-09-22T01:09:26Z

I'm going to add something to the docs about this, maybe a test case with this, and I'll look into adding an option to read in the file, but it depends on how xlwt and openpyxl work.

jreback · 2013-09-22T01:13:29Z

@jtratner what about a context manager get_excel?

with get_excel('foo.xlsx') as e:
    df.to_excel(e,'Data 0)
    df.to_excel(e,'Data 1)

?

jtratner · 2013-09-22T01:14:31Z

how about we just make ExcelWriter into a contextmanager instead? it'll just call save at the end. Much simpler.

jtratner · 2013-09-23T04:16:20Z

@ligon you can do this now this way:

with ExcelWriter('foo.xlsx') as writer:
    df.to_excel(writer, 'Data 0')
    df2.to_excel(writer, 'Data 1')

If you don't use the with statement, just have to call save() at the end.

ligon · 2013-09-23T17:06:12Z

Excellent. And great that it has an exit method.

Thanks,
-Ethan Ligon

Ethan Ligon, Associate Professor
Agricultural & Resource Economics
University of California, Berkeley

dylancis · 2014-01-10T11:36:06Z

I was extremely interesting by the request made by @ligon - but seems this is already there.
However using 0.12.0 pd version, when I am doing:
df = DataFrame([1,2,3])
df2 = DataFrame([5,5,6])
with ExcelWriter('foo.xlsx') as writer:
df.to_excel(writer, 'Data 0')
df2.to_excel(writer, 'Data 1')

Assumning foo.xlsx was containing a sheet named 'bar', basgot delete after the command run. While as per your comment, i was expecting to keep it in my foo excel file. Is that a bug?

frenet · 2014-04-04T01:46:33Z

is it hard to add sheets to an existing excel file on the disk?
import pandas as pd
import numpy as np
a=pd.DataFrame(np.random.random((3,1)))
excel_writer=pd.ExcelWriter('c:\excel.xlsx')
a.to_excel(excel_writer, 'a1')
excel_writer.save()

excel_writer=pd.ExcelWriter('c:\excel.xlsx')
a.to_excel(excel_writer, 'a2')
excel_writer.save()

here only sheet 'a2" is save, but I like to save both 'a1' and 'a2'.

I know it is possible to add sheets to an existing workbook.

jtratner · 2014-04-04T01:48:40Z

It's definitely possible to add sheets to an existing workbook, but it's
not necessarily easy to do it with pandas. I think you'd have to read the
workbook separately and then pass it into the ExcelWriter class... That
would be something we could consider supporting.

jtratner · 2014-04-04T01:50:46Z

And I think if you subclass the ExcelWriter instance you want to use and
overwrite its__init__ method, as long as you set self.book it should work.
That said, no guarantee that this would continue to work in future
versions, since it's only a quasi-public API

ankostis · 2015-12-17T23:12:51Z

This stackoverflow workaround, which is based in openpyxl, ~~may work~~
(EDIT: indeed works, checked with pandas-0.17.0):

import pandas
from openpyxl import load_workbook

book = load_workbook('Masterfile.xlsx')
writer = pandas.ExcelWriter('Masterfile.xlsx', engine='openpyxl') 
writer.book = book
writer.sheets = dict((ws.title, ws) for ws in book.worksheets)

data_filtered.to_excel(writer, "Main", cols=['Diff1', 'Diff2'])

writer.save()

jreback · 2015-12-18T14:57:32Z

this would be pretty easy to implement inside ExcelWriter (Oder patch above)

prob just need to add a mode=kw and default to w and make a be append

pylang · 2016-05-11T22:00:53Z

Was this ever patched?

jreback · 2016-05-11T22:04:21Z

it seems that you can work around it (see above), but I suppose would be nice to actually do it from pandas.

andreacassioli · 2016-11-21T17:35:23Z

Hi, any follow up on this issue?
I can provide a use case: I have excel files with pivot tables and pivot graphs that I need to reach out people not proficient in Python.

My idea was to use pandas to add a sheet that contains the data for the pivot. But up to know I am stuck and the proposed workaround, thought not difficult, sounds a bit cumbersome . It would make sense to jsut have an option whether overwrite an existing file.

zeluspudding · 2016-12-11T21:28:10Z

Let me echo @jreback , it would be super nice if I could just add a sheet to an excel workbook from pandas.

jorisvandenbossche · 2016-12-11T21:52:00Z

To be clear, we agree that this would be a nice functionality, and would certainly welcome a contribution from the community.
Best way to have this in pandas is to make a PR!

aa3222119 · 2017-02-16T10:20:02Z

@jmcnamara how to use pandas.to_excel(Writer maybe use pandas.ExcelWriter) to add some data to an existed file , but not rewrite it??

jorisvandenbossche · 2017-02-16T10:27:09Z

@aa3222119 That is exactly what this issue is about: an enhancement request to add this ability, which is not yet possible today.

(BTW, it is not needed to post the same question in multiple issues)

aa3222119 · 2017-02-16T10:40:46Z

sorry. i will delete that one . BTW, will that be possible some day later？ @jorisvandenbossche

jmcnamara · 2017-02-16T10:41:41Z

BTW will that be possible some day later？ @jmcnamara

This isn't and won't be possible when using XlsxWriter as the engine. It should be possible when using OpenPyXL. I've seen some examples on SO like this one: jmcnamara/excel-writer-xlsx#157

aa3222119 · 2017-02-16T11:00:22Z

3Q very much! @jmcnamara
it is exactly what you said . use openpyxl 👍

import pandas as pd
from openpyxl import load_workbook
book = load_workbook('text.xlsx')
writer = pd.ExcelWriter('text.xlsx', engine='openpyxl')
writer.book = book
pd.DataFrame(userProfile,index=[1]).to_excel(writer,'sheet111',startrow=7,startcol=7)
pd.DataFrame(userProfile,index=[1]).to_excel(writer,'sheet123',startrow=0,startcol=0)
writer.save()
pd.DataFrame(userProfile,index=[1]).to_excel(writer,'sheet123',startrow=3,startcol=3)
writer.save()

all can be added to text.xlsx.
https://github.com/pandas-dev/pandas/issues/3441

Themanwithoutaplan · 2017-03-04T16:21:43Z

openpyxl allows you to put DataFrames wherever you want them

jgonzale · 2017-03-09T12:04:19Z

@ankostis, @aa3222119 when I follow the steps you comment, I always reach the following error:

Traceback (most recent call last):
File "./name_manipulation.py", line 60, in
df.to_excel(excel_writer, 'iadatasheet', startcol=0, startrow=5, columns=['codes', 'Zona Basica de Salud', month+"-"+year], index=False)
File "/Users/jgonzalez.iacs/Projects/SIIDI/PYTHON_ETLs/venv/lib/python3.4/site-packages/pandas/core/frame.py", line 1464, in to_excel
startrow=startrow, startcol=startcol)
File "/Users/jgonzalez.iacs/Projects/SIIDI/PYTHON_ETLs/venv/lib/python3.4/site-packages/pandas/io/excel.py", line 1306, in write_cells
wks = self.book.create_sheet()
AttributeError: 'str' object has no attribute 'create_sheet'

So, there is not solution yet, right?

Thanks

ankostis · 2017-03-09T12:38:27Z

Maybe the API has changed - it definitely worked back then.

Themanwithoutaplan · 2017-03-09T16:46:57Z

@jgonzale which engine are you using?

aa3222119 · 2017-03-10T02:00:47Z

@jgonzale by what python said , your excel_writer.book maybe just a str but not a workbook?

jgonzale · 2017-03-10T07:26:10Z

@aa3222119 Oh geez! You were right! Messing around with very similar names!

Thank you very much! 👏 👏 👏

wxl3322335 · 2017-04-03T17:55:34Z

thank you very much!

BLMeltdown · 2017-06-12T12:21:41Z

Hello
I have some use case where it would be useful:
Even with the ExcelWriter trick as:

with ExcelWriter('foo.xlsx') as writer:
df.to_excel(writer, 'Data 0')
df2.to_excel(writer, 'Data 1')

you can't add a plot that you need without saving the file and reopening it. With the risk of meddling with any formatting you have in the workbook.

There is indeed the workaround to use the plotting functions from pandas to save these in the files, but (there is a but), when you need something a little more sophisticated like showing a PCA components graph you built from scikitlearn PCA and matplotlib, then it becomes tedious.

Hence
a pandas.nondf_manager (non df object or filename).to_excel(usual syntax)
would be exceedingly fine.
Thanks.

orbitalz · 2017-07-24T13:10:37Z

I don't know how it is possible, however, it works for me

create_excel = 0
if plot_spectra != 0:
    for x in range(min_sigma, max_sigma, step_size):
        # apply gaussian
        df1 = gaussian_filter(df, sigma=x, mode=padding_mode)
        df2 = pd.DataFrame(df1)
        if save_file:
            if save_csv:
                df2.to_csv('{} {}{}.csv'.format(Output_file, 'sigma_', x,))
            if save_xlsx:
                if os.path.isfile('{}.xlsx'.format(Output_file)):
                    print("Warning! Excel file is exist")
                    break
                if create_excel == 0:
                    xlsx_writer = pd.ExcelWriter('{}.xlsx'.format(Output_file), engine='xlsxwriter')                                          
                    create_excel += 1
                df2.to_excel(xlsx_writer, '{}{}'.format('sigma_', x))
                if x == max_sigma-1:
                    xlsx_writer.save()

At the end, I got the excel file which have several work sheets.

jorisvandenbossche · 2017-07-27T08:35:43Z

@orbitalz you are creating an excel file the first time (xlsx_writer = pd.ExcelWriter(..)), and then adding multiple sheets to that file object. That is supported, but this issue is about adding sheets to an existing excel file.

orbitalz · 2017-08-01T02:44:38Z

I'm sorry for misunderstanding the topic and Thank you for pointing me out :)

tlysecust · 2018-05-23T09:21:00Z

@orbitalz You solve my problem ,but I don't known how it works

ivoska · 2019-09-23T12:37:09Z

mode={'a'} does not work as the documentation suggests
this is still a buggy mess

codewithpatch · 2019-11-04T03:40:12Z

Appending in the existing worksheet seems to work with
writer = pd.ExcelWriter('filename.xlsx', mode='a')

But, this only appends and does not overwrite sheets with the same sheetname

Example, my existing workbook has a sheetname 'mySheet'
If I try to do:
df.to_excel(writer, 'mySheet')
It will create a new sheet 'mySheet1' instead of rewriting the existing 'mySheet'

I wonder if there's any other way to append in the existing workbook, but overwriting sheets that you want to overwrite.

Hope someone helps.

anvesha-nextsteps · 2019-11-28T14:23:48Z

By using openpyxl as engine in ExcelWriter
writer = pd.ExcelWriter(filename, engine='openpyxl')
df.to_excel(writer, sheet_name)
at writer.save() i am getting this error
TypeError: got invalid input value of type <class 'xml.etree.ElementTree.Element'>, expected string or Element

irishun · 2020-03-15T23:44:56Z

By using openpyxl as engine in ExcelWriter
writer = pd.ExcelWriter(filename, engine='openpyxl')
df.to_excel(writer, sheet_name)
at writer.save() i am getting this error
TypeError: got invalid input value of type <class 'xml.etree.ElementTree.Element'>, expected string or Element

I have met the same error. Has anyone solved this issue?

LittleMoDel · 2020-03-20T16:01:33Z

engine should change to openyxl,because the default engine'xlsxwriter' NOT support append mode !

`
import pandas as pd

df= pd.DataFrame({'lkey': ['foo', 'bar', 'baz', 'foo'], 'value': [1, 2, 3, 5]})

#engine should change to openyxl,because the default engine'xlsxwriter' NOT support append mode !

writer = pd.ExcelWriter('exist.xlsx',mode='a',engine='openpyxl')

df.to_excel(writer, sheet_name ='NewSheet')

writer.save()

writer.close()

`

Pandas chooses an Excel writer via two methods:

the engine keyword argument
the filename extension (via the default specified in config options)
By default, pandas uses the XlsxWriter for .xlsx, openpyxl for .xlsm, and xlwt for .xls files. If you have multiple engines installed, you can set the default engine through setting the config options io.excel.xlsx.writer and io.excel.xls.writer. pandas will fall back on openpyxl for .xlsx files if Xlsxwriter is not available.

To specify which writer you want to use, you can pass an engine keyword argument to to_excel and to ExcelWriter. The built-in engines are:

openpyxl: version 2.4 or higher is required
xlsxwriter
xlwt

macifTest · 2022-01-31T20:40:35Z

Hello,
I have an issue with the use of Pandas + ExcelWriter + load_workbook.
My need is to be able to modify data from an existing excel file (without deleting the rest).
It works partly, but when I check the size of the produced file and the original one the size is quite different.
Moreover, it seems to lack some properties. Which leads to an error message when I want to integrate the modified file into an application.
The code bellow :

data_filtered = pd.DataFrame([date, date, date, date], index=[2,3,4,5])
book = openpyxl.load_workbook(file_origin)
writer = pd.ExcelWriter(file_modif, engine='openpyxl',datetime_format='dd/mm/yyyy hh:mm:ss', date_format='dd/mm/yyyy')
writer.book = book
## ExcelWriter for some reason uses writer.sheets to access the sheet.
## If you leave it empty it will not know that sheet Main is already there
## and will create a new sheet.
writer.sheets = dict((ws.title, ws) for ws in book.worksheets)
data_filtered.to_excel(writer, sheet_name="PCA pour intégration", index=False, startrow=2, startcol=5, header=False, verbose=True)
writer.save()`

Thanks

ligon closed this as completed Apr 23, 2013

ligon reopened this Apr 23, 2013

ghost assigned jtratner Sep 22, 2013

jtratner mentioned this issue Sep 22, 2013

ENH: Make ExcelWriter & ExcelFile contextmanagers #4933

Merged

jtratner closed this as completed in #4933 Sep 23, 2013

wesm unassigned jtratner Oct 12, 2016

jorisvandenbossche modified the milestones: Next Major Release, 0.13, Someday Dec 11, 2016

jorisvandenbossche reopened this Dec 11, 2016

WillAyd mentioned this issue May 29, 2018

Append Mode for ExcelWriter with openpyxl #21251

Merged

4 tasks

jreback modified the milestones: Someday, 0.24.0 Jun 19, 2018

jreback closed this as completed in #21251 Jun 19, 2018

roberthdevries mentioned this issue Mar 8, 2020

pd.ExcelFile closes stream on destruction in pandas 1.0.0 #31467

Closed

WillAyd mentioned this issue Apr 3, 2020

Append Mode for ExcelWriter with "xlsxwriter" #33264

Closed

Allow ExcelWriter() to add sheets to existing workbook #3441

Allow ExcelWriter() to add sheets to existing workbook #3441

Comments

ligon commented Apr 23, 2013

jreback commented Sep 22, 2013

jtratner commented Sep 22, 2013

jtratner commented Sep 22, 2013

jreback commented Sep 22, 2013

jtratner commented Sep 22, 2013

jtratner commented Sep 23, 2013

ligon commented Sep 23, 2013 • edited by jorisvandenbossche Loading

dylancis commented Jan 10, 2014

frenet commented Apr 4, 2014

jtratner commented Apr 4, 2014 • edited by jorisvandenbossche Loading

jtratner commented Apr 4, 2014 • edited by jorisvandenbossche Loading

ankostis commented Dec 17, 2015

jreback commented Dec 18, 2015

pylang commented May 11, 2016

jreback commented May 11, 2016

andreacassioli commented Nov 21, 2016

zeluspudding commented Dec 11, 2016 • edited Loading

jorisvandenbossche commented Dec 11, 2016

aa3222119 commented Feb 16, 2017

jorisvandenbossche commented Feb 16, 2017

aa3222119 commented Feb 16, 2017

jmcnamara commented Feb 16, 2017

aa3222119 commented Feb 16, 2017

Themanwithoutaplan commented Mar 4, 2017

jgonzale commented Mar 9, 2017

ankostis commented Mar 9, 2017

Themanwithoutaplan commented Mar 9, 2017

aa3222119 commented Mar 10, 2017

jgonzale commented Mar 10, 2017

wxl3322335 commented Apr 3, 2017

BLMeltdown commented Jun 12, 2017

orbitalz commented Jul 24, 2017

jorisvandenbossche commented Jul 27, 2017

orbitalz commented Aug 1, 2017

tlysecust commented May 23, 2018

ivoska commented Sep 23, 2019

codewithpatch commented Nov 4, 2019

anvesha-nextsteps commented Nov 28, 2019

irishun commented Mar 15, 2020

LittleMoDel commented Mar 20, 2020 • edited Loading

engine should change to openyxl,because the default engine'xlsxwriter' NOT support append mode !

macifTest commented Jan 31, 2022 • edited Loading

ligon commented Sep 23, 2013 •

edited by jorisvandenbossche

Loading

jtratner commented Apr 4, 2014 •

edited by jorisvandenbossche

Loading

jtratner commented Apr 4, 2014 •

edited by jorisvandenbossche

Loading

zeluspudding commented Dec 11, 2016 •

edited

Loading

LittleMoDel commented Mar 20, 2020 •

edited

Loading

macifTest commented Jan 31, 2022 •

edited

Loading