Blog Posts

By Category
Oracle Technology
SAS
Data Migration
Data Warehouse
By Date
December 2007 (2)
Even if you don't use Microsoft Office 2007, you might have noticed more ".xlsx" files floating around lately. Perhaps you've been sent one or two that you can't open. XLSX is one of the new Microsoft Excel 2007 file formats. (Others include XLSB and XLSM.)

Like many software applications, SAS Enterprise Guide 4.1 cannot read these new file formats directly. The technology that we use to read Microsoft Excel files (and Microsoft Access files, which now sport an ".accdb" extension) was not updated to handle the fancy new formats produced by Microsoft Office.

One workaround is to ask your friends/colleagues/data suppliers to "please save these data files in a good old-fashioned XLS file that I can actually use." But if you cannot bear the shame of being "so, like, 2003-ish", there is another way to open these new files.

A reader from Bejing commented on a recent post with a question about data lengths and formats. While that wasn't really related to my post, I thought I'd attempt to answer in a new entry, here.

The question is basically this: when I combine two data sets with a common-named column, why does the resulting data set seem to cut the length short on the shared column?

Let's start with some definitions related to columns.


Continue reading "Lengths and formats: the long and short of it"