PowerPivot from Identical Excel Files

You can use the PowerPivot add-in for Excel 2010 to create a report from multiple Excel workbooks or worksheets, by joining the tables using the Primary and the Foreign key, such as ‘ProductID’ in a Sales table and a Pricing table.

In this example though, we want to combine the data in two Excel files that have an identical structure — sales data for the East and West regions. In this case, we can’t use a key to connect the tables; instead, we want to create one combined table from all the data.

The following technique allows you to import more than a million records from Excel, despite the fact that one worksheet can only contain up to 1,048,576 rows.

At least that’s possible in theory — on my computer it imported about 1.2 million, then gave up, after whining about memory resources.

Thanks to Excel MVP, Kirill Lapin, for sharing this very helpful tip with us. You can see more of Kirill’s work in last week’s posts on Combining Data from Two Excel Files in a Pivot Table.

Create a Connection in the Workbook

The key to this technique is to start by creating a workbook connection, before you launch PowerPivot.

  1. On the Excel Ribbon’s Data tab, click Connections.
  2. In the Workbook Connections window, click Add
  3. At the bottom of the Existing Connections window, click Browse for More.
  4. Navigate to the folder where your files are located.
  5. Select one of the files that you want to import — EastSales.xlsx in this example — and click Open.
  6. Select a table to import, and click OK.
  7. The new connection appears in the Workbook Connections window.

powerpivotunion05

Combine the Data in PowerPivot

  1. Close the Workbook Connections window, and on the Ribbon, click the PowerPivot tab.
  2. Click PowerPivot Window, to launch the PowerPivot add-in.
Combine the Data in PowerPivot
Combine the Data in PowerPivot

Note: If you’re using Windows XP, the PowerPivot window has a menu bar. If you’re using Vista or Windows 7, you’ll see a Ribbon instead.

  1. On the Table menu, click Existing Connections, or, on the Ribbon, click Design, then Existing Connections.
  2. At the bottom of the Existing Connections window, under Workbook Connections, click on the connection that you added, and click Open.
  3. In the Table Import Wizard, click Next, then select the table, and click Finish
  4. After the data is successfully imported, click Close.

powerpivotunion10

Change the SQL Statement

Now that the first table has been imported, you can change its properties, to combine it with data from the second table.

  1. On the Table menu, click Table Properties, or on the Ribbon, click the Design tab, then click Table Properties.
  2. At the right, from the Switch To drop down list, select Query Editor.
  3. Edit the SQL statement, to create a union query, combining the two tables. In this example, the SQL statement is:

SELECT [EastSales$].* FROM [EastSales$] UNION ALL SELECT * FROM ‘C:\_TESTWestSales.xlsx’.[WestSales$]

After you change the SQL statement, click the Validate button, to verify that the statement is correct, then click Save.

powerpivotunion13

Note: The SQL query string can also be edited in the Excel workbook connection window, by selecting the connection, and clicking Properties. However, there’s no Validate feature there.

Create the Pivot Table

Next, you can create a pivot table from the combined data.

  1. On the Toolbar, click the Create a PivotTable button, or on the Ribbon, click the Home tab, then click PivotTable.
  2. Select a location for the pivot table, and click OK.
  3. Add fields to the pivot table layout, to see a summary of the data.

Here’s the pivot table that was created from the combined data, with columns for the East and West regions. The Report Layout is Tabular, and Number format is used, with thousands separator and zero decimals.

powerpivotunion17

Detailed Instruction and Sample Files

To see detailed instructions for this technique, with more screen shots, visit the PowerPivot from Identical Structure Excel Files page on the Contextures website.

That page also has a link for downloading the East and West sales data that I used in this example.

Watch the PowerPivot Video

To see the steps for combining data from multiple tables in PowerPivot, you can watch this PowerPivot from Identical Excel Files video tutorial.

Download the PowerPivot Add-In

You can download the free PowerPivot add-in from the Microsoft website: PowerPivot Download (Note: This link is no longer available)
__________

0 thoughts on “PowerPivot from Identical Excel Files”

  1. for three or more excel sheets, you can use following formula that seems to be the only one to work:
    SELECT [Sheet1$].*
    FROM [Sheet1$]
    UNION ALL
    SELECT * FROM `C:\$link trials\2008 short example trial.xlsx`.[sheet2$]
    UNION ALL
    SELECT * FROM `C:\$link trials\2008 short example trial.xlsx`.[sheet3$]
    Now, this works with data that can fit on an excel sheet.
    Does anybody know how I can do this with extreme huge data which is to big to be put in an excelsheet of excel 2010?
    In combination of SQL?
    Best regards,
    Thierry

    1. I tried doing this with comma delmited text files but it won’t allow me to access the Switch to drop down list and select query editor.
      Was anyone able to address Thierry’s question relating to do this with huge data files greater than 1.1 million rows (e.g. comma delimited Text Files with >2million rows)?

  2. SELECT [‘Dist1$’].*
    FROM [‘Data$’]
    UNION ALL
    SELECT *FROM `C:\Master\Dist1.xlsx`.[Data$]
    The 4 lines above worked fine. But if I tried to add in another line, it no longer works. Does anyone know how can I make this work?
    UNION ALL
    SELECT *FROM `C:\Master\Dist2.xlsx`.[Data2$]
    When I try to validate, the following error pops up
    “The SQL statement is not valid. There are no columns detected in the statement.’

  3. SELECT [dbo].[v_carbonreportingv2].* FROM [dbo].[v_carbonreportingv2]
    UNION ALL
    SELECT * FROM `C:\TEST\BEImportfilCarbonReporting_V2.xlsx`.[‘Import to database$’]
    Incorrect syntax near ‘`’.
    I’m trying to combine an excel worksheet with an sql server table but only gets syntax error. When I try to combine to excel worksheet I don’t get the syntax error. Is it not possible to combine excel worksheet and a database table?
    I have the same columns in the worksheet as I have in the table, also same formatting…

  4. I was having issues with the name of the file, which started by producing error: “The SQL statement is not valid. There are no columns detected in the statement” for this statement:-
    SELECT [‘Prime ref prompt – Parties (1)$’].* FROM [‘Prime ref prompt – Parties (1)$’]
    UNION ALL
    SELECT * FROM ‘C:\CData\temp file.xlsx’.[data$]
    So I (eventually) deleted the first two lines leaving:-
    SELECT * FROM ‘C:\CData\temp file.xlsx’.[data$]
    then I git the following error: ”C:\CData\temp file.xlsx” is not a valid name. Make sure that it does not include invalid characters or punctuation and that it is not too long.
    What!?!? I thought. Then it occurred to me, It’s a SQL statement, use [] for containing names. This now works 🙂
    SELECT [‘Prime ref prompt – Parties (1)$’].* FROM [‘Prime ref prompt – Parties (1)$’]
    UNION ALL
    SELECT * FROM [C:\CData\temp file.xlsx].[data$]
    Also only use [‘ ‘] if the worksheet name contains spaces….. hope this helps someone else!

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.