Cloud Computing Resources for Small and Midsize Businesses

Cloud Computing for SMBs

Subscribe to Cloud Computing for SMBs: eMailAlertsEmail Alerts newslettersWeekly Newsletters
Get Cloud Computing for SMBs: homepageHomepage mobileMobile rssRSS facebookFacebook twitterTwitter linkedinLinkedIn


SMB Cloud Authors: Yeshim Deniz, Harry Trott, Breaking News, Breaking New, Sarah Patrick

Related Topics: Cloud Computing, SOA & WOA Magazine, Big Data on Ulitzer

Blog Feed Post

Big Data versus Small Data

How do you know whether you are dealing with Big Data or Small Data?

How do you know whether you are dealing with Big Data or Small Data? I’m constantly asked for my definition of “Big Data”. Well, here it is…for batch analytics.

Big-Small

Batch Analytics

Batch Analytics Small Data Big Data
Data Volume Gigabytes Terabytes – Petabytes
Data Sources 1-6 (structured – SQL, or unstructured – NoSQL) 6+ structured AND 6+ unstructured
Business Functions One line of business (e.g. sales) Several lines of business all the way up to a 360 degree view of the business
Business Questions Queries are complex requiring many concurrent data modifications, a rich breadth of operators, and many selectivity constraints. However, they are applied to a simpler data structure.Example: determine how much sales is made on a given line of parts, broken out by supplier, by geography, by year.

 

 

Queries are complex requiring many concurrent data modifications, a rich breadth of operators, and many selectivity constraints. Queries span across business function.Example: determine how much profit (sales – COGs – allocated operating expense) is made on a given line of parts, broken out by supplier, by geography, by year; and then determine which customers purchased the higher profit parts, by geography, by year; determine the profile (demographics) of those high-profit customers; find out what products purchased by those high-profit customers were NOT purchased by other similar customers in order to cross-sell / up-sell.

More ad hoc and interactive analytics next….

Read the original blog entry...

More Stories By Jim Kaskade

Jim Kaskade currently leads Janrain, the category creator of Consumer Identity & Access Management (CIAM). We believe that your identity is the most important thing you own, and that your identity should not only be easy to use, but it should be safe to use when accessing your digital world. Janrain is an Identity Cloud servicing Global 3000 enterprises providing a consistent, seamless, and safe experience for end-users when they access their digital applications (web, mobile, or IoT).

Prior to Janrain, Jim was the VP & GM of Digital Applications at CSC. This line of business was over $1B in commercial revenue, including both consulting and delivery organizations and is focused on serving Fortune 1000 companies in the United States, Canada, Mexico, Peru, Chile, Argentina, and Brazil. Prior to this, Jim was the VP & GM of Big Data & Analytics at CSC. In his role, he led the fastest growing business at CSC, overseeing the development and implementation of innovative offerings that help clients convert data into revenue. Jim was also the CEO of Infochimps; Entrepreneur-in-Residence at PARC, a Xerox company; SVP, General Manager and Chief of Cloud at SIOS Technology; CEO at StackIQ; CEO of Eyespot; CEO of Integral Semi; and CEO of INCEP Technologies. Jim started his career at Teradata where he spent ten years in enterprise data warehousing, analytical applications, and business intelligence services designed to maximize the intrinsic value of data, servicing fortune 1000 companies in telecom, retail, and financial markets.