Historically, search engines have been unable to extract content, such as text and links, from Flash (SWF) files. Subsequently, much of the Flash-based content on the web has been unavailable in search results. This situation has been frustrating for web developers, who have tried to come up with workarounds to help get search engines to index and rank their Flash pages.
This situation hasn’t been ideal for searchers either, as this limitation has kept them from seeing potentially great matches for their queries because they’ve been locked away in Flash files.
According to Adobe and Google, all of that is changing. Google is launching what they they say is a “deep algorithmic change,†augmented by Flash reader technology supplied by Adobe, that enables them to “read†Flash files and extract text and links from it for better indexing and ranking. This could be great news for both site owners and searchers.
Google has been working on improving how they crawl and index rich content (such as Flash and JavaScript) for some time, and in fact have been able to extract some text and links from Flash files for a while. Adobe makes Google’s algorithms “less error prone†and enables them to access content created in any version of Flash in a variety of languages.
Adobe says they have developed an optimized Flash player for search engines and are collaborating with both Google and Yahoo!. Yahoo! has not yet implemented the technology, although they stated that “Yahoo! is committed to supporting webmaster needs with plans to support searchable SWF and is working with Adobe to determine the best possible implementation.†Adobe hasn’t made the technology available to Microsoft’s Live Search, although they say they are “exploring ways to make the technology more broadly available†to “help make all SWF content more easily searchable.†A quick look at how SWF files are currently indexed shows that there’s a lot of room for improvement, so this may indeed be big news for search.

Flash has often been a source of frustration for SEOs who argue that text should be in HTML, with Flash used for non-textual content, such as video illustrations. Can SEOs now remove the “review Flash implementation†line from their checklists? Probably not. However, it should be easier for SEOs to work with Flash-based sites going forward.
You should keep in mind that these new algorithms don’t take into account any meta data or formatting markup in the Flash file and, for now, Google’s cache won’t show a representation of the extracted text so site owners can’t verify what is actually being crawled by viewing the cached copy. In addition, since Googlebot doesn’t execute most JavaScript, Google won’t crawl or index any Flash executed via JavaScript. Any external sources that the Flash file loads will be indexed separately, rather than as part of the Flash file. And as noted earlier, all non-textual content will remain uncrawled. This new Flash support covers all languages other than bidirectional ones (Hebrew and Arabic) and all versions of Flash.
Labels: Flash, HowTo, SEO