The Hanzo Blog
11 Jan 10
Hanzo at LegalTech NY 2010
Hanzo will be at LegalTech NY 2010, exhibiting in booth 536.
Visit us to find out why leading banks choose Hanzo to archive their websites for FINRA and SEC 17a-4 compliance, or why the worlds most successful brand use Hanzo to archive their online branding and promotions activities.
If you require FINRA or SEC compliance for your websites, or if your websites are included in your litigation readiness plans, visit Hanzo in booth 536 to find out how.
13 Nov 09
Hanzo takes archiving of websites to a forensic level
Bruce Wilson is a law, technology, and business development consultant whose experience includes leadership roles in business consulting, law and IT. Following our meeting at LegalTech NY in February, and a couple of conversations over this fall, Bruce has just published this great interview:
http://wilsonig.com/2009/11/12/forensic-archiving-and-search-of-web-2-0-sites/
Thanks Bruce!
01 Sep 09
Python Developers Wanted
Note to agencies: Do not contact us about this vacancy as we do not use agencies as a rule. If we ever do, we’ll contact you!
About the Company
Hanzo Archives Limited is a small, cutting-edge web archiving software and service company providing website e-discovery and compliance solutions to the corporate Global 500 market. Founded and currently operating in Europe, we are now set to expand into the North American markets following early commercial successes.
Job Summary
Reporting to the Chief Technical Officer, the python software developer will primarily help with the development of our advanced archival crawler, crawler tools, and the technical side of the archiving operations for customers. The software engineer will ensure high-quality software products are produced and we continue to deliver innovative and high-quality services. This will include writing software products and tools to help with these tasks.
How to Apply
Please forward a covering letter stating your interest and suitability along with your CV and contact details (such as home address, mobile number, email, IM/skype) to:
Interviews will be held in London in September/October 2009.
17 Jun 09
Job Vacancy: Contract Crawl Engineer
Hanzo are seeking bright, enthusiastic, self motivated software engineers with strong problem solving skills and who love a challenge and are enthusiastic about working on the delivery of our highly technical crawler operations and development of products. A proven ability in Python and Javascript and knowledge of the workings of the web is essential. Strong Unix or Linux skills including scripting with command line tools like Find, Grep and Awk will be important.
This vacancy is closed. Thanks!
25 Mar 09
Celebrating Ada Lovelace Day
Today I’d like to celebrate Ada Lovelace Day with a brief mention of these great women in technology:
- Kris Carpenter, Director, Web Archive
- Kristine Hanna, Director, Web Archiving Services (she was co-founder of GeekGirls too for goodness sake)
- Molly Bragg, Partner Specialist, Web Archiving Services
These creative and brilliant women work tirelessly to collect and preserve the public web for our good friends, and yours too incidentally, the Internet Archive, whose simple motto sums up their contribution to technology and society so well: “Universal Access to All Human Knowledge (for Free, for Ever).”
from Mark
17 Mar 09
World Wide Web of Humanities Presentation at University of Oxford
Mark Middleton of Hanzo will present Search and Analysis of Data in WWWoH at the “Humanities on the Web: Is it working?” workshop at the Tsuzuki Lecture Theatre, St Anne’s College, Oxford, on 19 March 2009. This presentation is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 2.0 UK: England & Wales License.
The presentation is a summary of our open source Search Tools project. A demo of the search tools is here.
Permissions beyond the scope of this license may be available, please contact us for more information.
12 Mar 09
Hanzo Archives show web archiving at LegalTech
From Hanzo Archives show web archiving at LegalTech « Chris Dale Lawyer Support
Short but absolutely on-the-button summary of Hanzo's selling message on Chris Dale's blog. Which, incidentally, is one of the few must-read blogs on e-discovery from the UK.
With web content now under greater scrutiny, web content collection and preservation for compliance and litigation should be tailored into records management practices. Hanzo's Webhold and Hanzo Enterprise products meets these requirements.
10 Feb 09
Hanzo at LegalTech NY 2009
Hanzo exhibited at LegalTech NY in Feb 2009. Here's what we learned:
- Lawyers, litigation support people, and records managers didn't know it was possible to archive websites in any way other than backup tapes
- Backup tapes are a nightmare when it comes to reviewing or producing website content
- These same people also didn't know it was possible to review and produce archived websites in native form, still browsable and searchable
- Intranet archiving is a HOT topic for the larger corporations
- Records managers are our friends
I hope to write more on these topics on our website and blog.
02 Dec 08
How Websites Differ From Other Electronic Files
Regarding my previous post, covering Judge Hedges decision:
From Hanzo Archives - Finding “No Reason to Treat Websites Differently than Other Electronic Files,†Court Grants Adverse Inference for Failure to Preserve Website : Electronic Discovery Law
I think it is important to recognise that while Judge Hedges is absolutely right from a legal perspective, it doesn't automatically follow from a technical perspective. This needs some explaining.
Websites are compound, complex, interconnected and hyperlinked collections of compound, complex, interconnected and hyperlinked documents. Lots of moving parts (and syllables).
This is quite different to other ESI. Consider, for example, an electronic file, such as a word or excel document, corresponds to a single document; an email is an envelope containing a single message, with metadata and attachments. A web-based document on the other hand, more often than not consists of many files: an html page, javascript code, style sheet(s), images, embedded media (possibly streaming), and links to different parts of the html file, other html files or documents, or other websites. Websites are not the same!
At the human level, Courts view web-based documents the same as your average human reader, i.e. the compound document described above, not the individual component parts. To preserve such documents in their native form, it is necessary to collect all the components parts correctly and store each and all of them unchanged.
As such, file-oriented methods for preservation are clearly inappropriate for websites.
30 Oct 08
Websites Are Like Any Other ESI
Re: Arteria Prop. Pty Ltd. v. Universal Funding V.T.O., Inc., 2008 WL 4513696 (D.N.J. Oct. 1, 2008):
From Finding "No Reason to Treat Websites Differently than Other Electronic Files," Court Grants Adverse Inference for Failure to Preserve Website : Electronic Discovery Law
This is a great decision for Hanzo customers. Here are the key issues raised:
- You are responsible for your website, no matter who maintains it or hosts it, it is your responsibility
- If you reasonably anticipate litigation you are required to preserve your website - "litigation hold"
This decision clearly underlines our product strategy.
We've designed our web archiving tool for exactly this scenario. As responsible owner of your websites, you should have records and information management policies in place to systematically archive your websites -- a "web archive". You can't rely on the developers or agencies involved in its development or hosting.
Hanzo archives any number of websites, from multiple URLs, CMS, databases and technologies, according to an agreed archive policy, and stores them in a secure, authenticating web archive. The web archive is an independent store for all your website content, enabling you to retain them according to your information management policies. This requires no additional effort by or consent from your developers, website designers, marketing agencies or hosting partners.
Secondly, the web archive provides a litigation hold for any or all of the web archive content. Moreover it is fully browsable, searchable and exportable, enabling discovery of your web resources in a fraction of the time taken using traditional preservation methods.
A more complete description of the case and the decision are on the Electronic Discovery Law blog.
21 Oct 08
Producer required to re-produce .TIFF documents in a “reasonably searchable” format
Producing web resources in native format can be burdensome. But we’ve changed that dramatically with our web e-discovery products. Don’t expect to get away with this anymore…
Hanzo uses client-side archiving technology, including proprietary web crawlers, API’s and plug-ins, that enable preservation of web resources in their native format: exactly the same format presented to browsers.
These resources are stored in archive files together with metadata verifying their authenticity. The archive files are ingested into a web archive and indexed. These can then be browsed the same way as the original website, along any captured timeline, and searched across full text, metadata and time.
More information on this is in our white paper “E-discovery: Why Archiving Your Web Presence is a Business Necessity”.
If a Web page, blog, thread in a customer forum, or your whole website were required by the courts, how would you be able to obtain the exact version required and present it as it was originally? How would you verify its authenticity? As regulations concerning corporate records and e-discovery proceedings are extended to include Web content, can you be certain you are compliant?
This white paper looks at compliance and e-discovery issues relating to Web content, and assesses the technologies you need to archive your Total Web Presence.
