Background
It is a very common requirement in SSIS packages to extract contents of a compressed ZIP file, and then process that file in a data flow. From what I can see, it appears that TaskUnzip is quite a popular way to achieve this. So as part of the BI Aviation project I decided to use this for extracting the data from the ZIP files available in those datasets.
Using TaskUnzip
In my testing I used build 1.3.0.1 available for SQL 2008 (SSIS_TaskUnZip_1.3.0.1_SQL_2008.zip). I suspected it may not work in SQL 2012 as-is, but decided to give it a try. The setup instructions are included in the ZIP file, so I ran the Install_SQL_2008.cmd to install the library. That failed, because the references are to SQL 2008. So here is the corrected installation script:
net stop "SQL Server Integration Services 11.0"
gacutil.exe /u "ICSharpCode.SharpZipLib"
gacutil.exe /u TaskUnZip
copy "ICSharpCode.SharpZipLib.dll" "%ProgramFiles%\Microsoft SQL Server\110\DTS\Tasks\" /Y
copy "TaskUnZip.*" "%ProgramFiles%\Microsoft SQL Server\110\DTS\Tasks\" /Y
gacutil.exe /i "%ProgramFiles%\Microsoft SQL Server\110\DTS\Tasks\ICSharpCode.SharpZipLib.dll"
gacutil.exe /i "%ProgramFiles%\Microsoft SQL Server\110\DTS\Tasks\TaskUnZip.dll"
net start "SQL Server Integration Services 11.0"
pause
The fixes are to change the references of 10.0 to 11.0 and 100 to 110, to suit the SQL 2012 version. You have to do similar corrections in the x86 version of the CMD script, because designers like the VS2010 based SSIS package editor are still 32-bit:
net stop "SQL Server Integration Services 11.0"
gacutil.exe /u "ICSharpCode.SharpZipLib"
gacutil.exe /u TaskUnZip
copy "ICSharpCode.SharpZipLib.dll" "%ProgramFiles(x86)%\Microsoft SQL Server\110\DTS\Tasks\" /Y
copy "TaskUnZip.*" "%ProgramFiles(x86)%\Microsoft SQL Server\110\DTS\Tasks\" /Y
gacutil.exe /i "%ProgramFiles(x86)%\Microsoft SQL Server\110\DTS\Tasks\ICSharpCode.SharpZipLib.dll"
gacutil.exe /i "%ProgramFiles(x86)%\Microsoft SQL Server\110\DTS\Tasks\TaskUnZip.dll"
net start "SQL Server Integration Services 11.0"
pause
At this stage, you would expect the TaskUnzip task should show up in the SSIS Toolbox, but if you have a fresh installation of SQL 2012, it will not show up!
Rebuilding TaskUnzip for SQL 2012
Start by downloading the sources (I used this link) and opening it in Visual Studio 2010 SP1. When you rebuild this project, you get the following errors:
The type or namespace name 'Dts' does not exist in the namespace 'Microsoft.SqlServer' (are you missing an assembly reference?)
The type or namespace name 'Task' could not be found (are you missing a using directive or an assembly reference?)
(errors truncated for brevity). This is due to the namespace changes in DTS in SQL 2012. You will also find that the references to the older DLLs are broken:
We also need to change the .NET Framework version to 4.0 runtime (read more about this here):
Then remove the old references and add Microsoft.SqlServer.Dts.Design and Microsoft.SQLServer.ManagedDTS (though the namespace names are the same, the DLLs they reference are different, hence you need to add them again.)
After this is done, the DLL build should succeed.
Deploying the recompiled DLL
After this is done, if you uninstall and reinstall the newer DLL you might get the following error:
C:\2008>gacutil.exe /i "C:\Program Files\Microsoft SQL Server\110\DTS\Tasks\TaskUnZip.dll"
Microsoft (R) .NET Global Assembly Cache Utility. Version 3.5.21022.8
Copyright (c) Microsoft Corporation. All rights reserved.Failure adding assembly to the cache: This assembly is built by a runtime newer than the currently loaded runtime and cannot be loaded.
This is because the gacutil.exe which ships with the TaskUnzip library ZIP file is actually for the framework version 3.5. To fix this you need a gacutil for v4.0. You can find an updated gacutil at “c:\Program Files (x86)\Microsoft SDKs\Windows\v7.0A\Bin\NETFX 4.0 Tools” (this path is on your VS.NET box). Copy these to your deployment server and overwrite the version which shipped with the TaskUnzip version. More details on this step are here.
Conclusion
That’s it! By recompiling the DLL you can add it to the toolbox and then into your Control Flows in SSIS. We will be using this component to import our BI Aviation data, so stay tuned for more details on how that is done!
And in closing, please take a minute to leave your questions or comments in the space provided at the end of this post. I do appreciate your feedback!
Disclaimer: the mention of 3rd party or community products in these blog posts in no way constitutes a recommendation or advice. These are my personal opinion and do not reflect any of my employer’s opinions.