An architectural framework for software recovery of hardware faults




















Download [PDF ] [PPT ] Abstract As technology scales ever further, device unreliability is creatingexcessive complexity for hardware to maintain the illusion of perfectoperation. In this paper, we consider whether exposing hardware fault information to software and allowing software to control fault recovery simplifies hardware design and helps technology scaling. The combination of emerging applications and emerging many-core architectures makes software recovery a viable alternative to hardware-based fault recovery.

Fully managed environment for developing, deploying and scaling apps. Processes and resources for implementing DevOps in your org. End-to-end automation from source to production. Fast feedback on code changes at scale. Automated tools and prescriptive guidance for moving to the cloud. Program that uses DORA to improve your software delivery capabilities. Services and infrastructure for building web apps and websites. Tools and resources for adopting SRE in your org.

Add intelligence and efficiency to your business with AI and machine learning. Products to build and use artificial intelligence. AI model for speaking with customers and assisting human agents. AI-powered conversations with human agents. AI with job search and talent acquisition capabilities. Machine learning and AI to unlock insights from your documents. Mortgage document data capture at scale with machine learning.

Procurement document data capture at scale with machine learning. Create engaging product ownership experiences with AI. Put your data to work with Data Science on Google Cloud. Specialized AI for bettering contract understanding. AI-powered understanding to better customer experience. Speed up the pace of innovation without coding, using APIs, apps, and automation.

Attract and empower an ecosystem of developers and partners. Cloud services for extending and modernizing legacy apps. Simplify and accelerate secure delivery of open banking compliant APIs. Migrate and manage enterprise data with security, reliability, high availability, and fully managed data services. Guides and tools to simplify your database migration life cycle. Upgrades to modernize your operational database infrastructure. Database services to migrate, manage, and modernize data.

Rehost, replatform, rewrite your Oracle workloads. Fully managed open source databases with enterprise-grade support. Unify data across your organization with an open and simplified approach to data-driven transformation that is unmatched for speed, scale, and security with AI built-in. Generate instant insights from data at any scale with a serverless, fully managed analytics platform that significantly simplifies analytics.

Digital Transformation Accelerate business recovery and ensure a better future with solutions that enable hybrid and multi-cloud, generate intelligent insights, and keep your workers connected. Business Continuity. Proactively plan and prioritize workloads. Reimagine your operations and unlock new opportunities. Prioritize investments and optimize costs.

Get work done more safely and securely. How Google is helping healthcare meet extraordinary challenges. Discovery and analysis tools for moving to the cloud. Compute, storage, and networking options to support any workload. Tools and partners for running Windows workloads. Migration solutions for VMs, apps, databases, and more. Automatic cloud resource optimization and increased security.

End-to-end migration program to simplify your path to the cloud. Ensure your business continuity needs are met. Change the way teams work with solutions designed for humans and built for impact. Collaboration and productivity tools for enterprises. Secure video meetings and modern collaboration for teams. Unified platform for IT admins to manage user devices and apps. Enterprise search for employees to quickly find company information.

Detect, investigate, and respond to online threats to help protect your business. Solution for analyzing petabytes of security telemetry. Threat and fraud protection for your web applications and APIs. Solutions for each phase of the security and resilience life cycle.

Solution to modernize your governance, risk, and compliance function with automation. Data warehouse to jumpstart your migration and unlock insights. Services for building and modernizing your data lake. Run and write Spark where you need it, serverless and integrated. Insights from ingesting, processing, and analyzing event streams. Solutions for modernizing your BI stack and creating rich data experiences. Solutions for collecting, analyzing, and activating customer data.

Solutions for building a more prosperous and sustainable business. Data from Google, public, and commercial providers to enrich your analytics and AI initiatives. Accelerate startup and SMB growth with tailored solutions and programs. Get financial, business, and technical support to take your startup to the next level. Explore solutions for web hosting, app development, AI, and analytics. Build better SaaS products, scale efficiently, and grow your business.

In this paper, we consider whether exposing hardware fault information to software and allowing software to control fault recovery simplifies hardware design and helps technology scaling. The combination of emerging applications and emerging many-core architectures makes software recovery a viable alternative to hardware-based fault recovery. To ensure an application is resilient to end-to-end failures, it is essential that all fault-points and fault-modes are understood and operationalized.

To learn more, see Failure mode analysis for Azure applications. Eliminate all single point of failure. A single point of failure describes a specific fault-point which if it where to fail, would bring down the entire application. Single points of failure introduce significant risk since any failure of this component will cause an application outage. To learn more, see Make all things redundant. Eliminate all singletons. A singleton describes a logical component within an application for which there can only be a single instance.

It can apply to stateful architectural components or application code constructs. Ultimately, singletons introduce a significant risk by creating single points of failure within the application design. Internal dependencies describe components within the application scope which are required for the application to fully operate. External dependencies capture required components outside the scope of the application, such as another application or third-party service.

Dependencies may be categorized as either strong or weak based on whether or not the application is able to continue operating in a degraded fashion in their absence.



0コメント

  • 1000 / 1000