Skip to content

Searching Binary Data in SQL Server

November 1, 2012

More and more data collected in organizations is in an encoded format, essentially a binary classification of data. These can be images, audio files, video, or even common formats like Word and Excel files. This data contains lots of important data, but the formatting must be stripped out in order for users to effectively search this data.

This presentation starts with a discussion of the three types of data in SQL Server to set the framework. It demos and explains:

  • structured data
  • semi-structured data
  • unstructured data

The talk then looks at how unstructured data is stored in SQL Server, specifically briefly looking at Filestream and Filetable.

There is a short discussion of full text search, with a look at the changes in SQL Server 2012 before moving on to the iFilter interfaces which are used to search the binary data while ignoring the encoding. There are demos of the basics of CONTAINS and FREETEXT searches, along with some of the more advanced options, like customizable NEAR and weighting of search terms.

The talk finishes with a short look at the new semantic search feature in SQL Server 2012.

Level: 200

Length: 60 minutes

Downloads: PPTX, code

Presentation Schedule:

June 1, 2013 – SQL Saturday #200 – Philadelphia

May 3, 2013 – SQL Bits XI

April 7, 2013 – SQL Saturday #197 – Omaha

October 30, 2012 – SQL Connections, Fall 2012

About these ads

From → Presentations

Comments are closed.

Follow

Get every new post delivered to your Inbox.

Join 4,636 other followers

%d bloggers like this: