close
close

Unstructured Introduces Unstructured Serverless API: The Simplest, Fastest, and Most Cost-Effective Way to Make Enterprise Data AI-Ready

Unstructured Introduces Unstructured Serverless API: The Simplest, Fastest, and Most Cost-Effective Way to Make Enterprise Data AI-Ready

Unstructured Introduces Unstructured Serverless API: The Simplest, Fastest, and Most Cost-Effective Way to Make Enterprise Data AI-Ready
https://unstructured.io/blog/introducing-unstructured-serverless-api

The rapid evolution of AI and machine learning (ML) requires robust, scalable and efficient data processing solutions. Unstructured, a leading innovator in data transformation, presents its Unstructured serverless APIa revolutionary development aimed at simplifying, accelerating and reducing the costs of preparing business data for AI.

Introduction to Unstructured Serverless API

The unstructured serverless API represents the pinnacle of data processing technology, designed to make enterprise data ready for AI applications in a transparent and cost-effective manner. This new offering from Unstructured is poised to redefine data management with several key improvements:

  • New registration flow and admin dashboard: Improves user experience with simplified integration and efficient management tools.
  • Per page pricing model: This introduces predictable and reduced costs, allowing users to pay based on the number of pages processed.
  • Improved performance metrics: Achieves a 5x improvement in PDF processing throughput, 70% better table classification, 11% higher text accuracy, and 20% reduction in word error rate.

Benefits of Unstructured Serverless API

Improved transformation performance

The unstructured serverless API leverages next-generation document transformation models, delivering unprecedented performance improvements over its open source predecessors. The main benefits include:

  • Faster processing throughput: PDF processing is now five times faster.
  • Better classification of tables: The accuracy of table detection and structuring improved by 70%.
  • Higher text accuracy: Text extraction accuracy improved by 11%.
  • Reduced word error rate: The word error rate decreased by 20%.

These enhancements facilitate superior AI-powered workflows in three critical areas:

  1. Data cleaning: Developers can easily remove unwanted elements from the document, such as headers, footers or images, ensuring cleaner data for AI processing.
  2. Advanced segmentation strategies: Developers can manage and process document sections more efficiently by grouping documents based on elements such as titles.
  3. Metadata Filtering: Improves data retrieval by prioritizing the most relevant information in a file during queries.

Improved developer experience

Unstructured’s commitment to providing an exceptional developer experience is evident in the new features of its Serverless API:

  • Updated onboarding process: A streamlined registration process ensures a smooth start for new users.
  • New admin panel: Simplifies API key management and usage tracking.
  • Full documentation: The newly revamped documentation provides clear and detailed guidance to users.

These improvements make the Unstructured Serverless API powerful and user-friendly, promoting a more productive development environment.

Profitability and pricing model

The introduction of the Unstructured Serverless API brought about a significant change in the pricing model. By moving from a computing hours-based pricing model to a per-page pricing model, Unstructured provides more predictability and transparency:

  • Fast pipeline: Costs $1 for 1,000 pages.
  • High resolution pipeline: Costs $10 for 1,000 pages.

This new pricing structure significantly reduces costs, making it more economical for users to process large documents. For example, processing 1,000 PDF pages now costs $10, compared to $12.93 under the previous model.

Performance Improvements

The unstructured Serverless API delivers near-instant startup speeds and reduced latency, thanks to continuously online worker nodes that reduce startup times to less than three seconds, compared to thirty minutes. Document preprocessing pipelines are also optimized, processing documents five times faster using techniques such as document splitting for parallelized transformation.

Security and Compliance

Ensuring that businesses can trust Unstructured Serverless API for their most critical data workloads, Unstructured has achieved SOC 2 Type 2 compliance. This certification highlights security, availability, processing integrity , privacy, and API privacy controls.

Conclusion

The unstructured serverless API is set to transform the way businesses manage data for AI applications, combining unparalleled performance, cost efficiency and ease of use. By providing scalable, resilient and secure data processing solutions, Unstructured enables organizations to harness the full potential of AI.

Asif Razzaq is the CEO of Marktechpost Media Inc.. As a visionary entrepreneur and engineer, Asif is committed to harnessing the potential of artificial intelligence for social good. Its most recent project is the launch of an artificial intelligence media platform, Marktechpost, which stands out for its in-depth coverage of machine learning and deep learning news, both technically sound and easily understandable to a wide audience. The platform has more than 2 million monthly views, illustrating its popularity among the public.

(Gretel Navigator Announcement) Create, edit, and augment tabular data with the first compound AI system trusted by EY, Databricks, Google, and Microsoft