Geeks With Blogs
Josh Reuben Cloud
Hive - HQL query over MapReduce
Overview Developed by Facebook HiveQL is a SQL-like framework for data warehousing on top of MapReduce over HDFS. converts SQL query into a series of jobs for execution on a Hadoop cluster. Organizes HDFS data into tables - attaching structure. Schema on Read Versus Schema on Write - doesn’t verify the data when it is loaded, but rather when a query is issued. full-table scans are the norm and a table update is achieved by transforming the data into a new table. HDFS does not provide in-place file ......

Posted On Tuesday, March 22, 2016 5:32 AM

Scalable Service Architecture
Overview A scalable cloud based architecture has converged upon a common design meta-pattern leveraging the following components, which are explained below: A Forward-Cache Proxy An elastic service Load Balancer Stateless Server Facade Nodes A Shared Memory State Cache Client requests hit the Forward-Cache Proxy which returns any memoized service requests if it can. If a new request signature passes the forward cache proxy, it hits the Load Balancer which dynamically allocates the request to one ......

Posted On Friday, March 13, 2015 5:03 AM

Azure Workflow
Workflow 1.0 beta provides new Azure functionality: – a scale-out enabled infrastructure for hosting Workflows installation on-premises, or on Azure roles You can view my full presentation on this here: I’ll bet this is released around Build Conference. Below is a short summary: Workflow Farm New distributed node managed host infra for executing workflows Scale-out load-balanced ......

Posted On Saturday, August 18, 2012 9:53 AM

Daytona - Iterative MapReduce on Windows Azure
Daytona - Iterative MapReduce on Windows Azure Overview MapReduce is a framework for processing highly distributable problems across huge datasets using a large number of compute nodes. It is a generic mechanism that comprises 2 steps: Map step: The master node takes the input, partitions it up into smaller sub-problems, and distributes them to worker nodes. The worker node processes the smaller problem, and passes the answer back to its master node. Reduce step: The master node then collects the ......

Posted On Thursday, December 8, 2011 7:26 AM

The Windows Azure HPC Scheduler SDK
Overview Windows HPC Server 2008 is infrastructure for high-end applications that require high performance computing clusters – i.e. for scaling out parallelizable across many compute nodes in a grid. These compute nodes can be coordinated by a head node , which in turn can be proxied via a service broker node that exposes a SOA WCF interface for job scheduling. Additional functionality includes the ability to coordinate between job processes running on nodes via MPI (message passing interface). ......

Posted On Tuesday, December 6, 2011 6:36 AM

Azure Grid Computing - Worker Roles as HPC Compute Nodes
Overview · With HPC 2008 R2 SP1 You can add Azure worker roles as compute nodes in a local Windows HPC Server cluster. · The subscription for Windows Azure like any other Azure Service - charged for the time that the role instances are available, as well as for the compute and storage services that are used on the nodes. · Win-Win ? - Azure charges the computer hour cost (according to vm size) amortized over a month – so you save on purchasing compute node hardware. Microsoft wins because you need ......

Posted On Tuesday, February 22, 2011 10:12 AM

Azure Futures - Distributed Computing and Number Crunching
"the biggest Azure customers today are the ones using HPC on-premises at the current time" - Orleans Framework for cloud computing - HPC on Azure - ......

Posted On Thursday, February 3, 2011 7:41 AM

Copyright © JoshReuben | Powered by: | Join free