SQL Server KBs and Blogs

Quick access to blogs posts from SQL Server Support Team., Some of the new KBs published for SQL and SQL Content news

Browse by Tags

Related Posts
  • Blog Post: How to add custom Hive UDFs to HDInsight

    I recently had a need to add a UDF to Hive on HDInsight. I thought that it would be good to share that experience on a blog post. Hive provides a library of built-in functions to achieve the most common needs. The cool thing is that it also provides the ...read more
  • Blog Post: How to pass Hadoop configuration values for a job on HDInsight

    I came across the question a few times recently from several customers– "how do we pass hadoop configurations at runtime for a mapreduce job or Hive Query via HDInsight PowerShell or .Net SDK?" I thought of sharing the answer here with ...read more
  • Blog Post: Using Apache Flume with HDInsight

    Gregory Suarez – 03/18/2014 (This blog posting assumes some basic knowledge of Apache Flume) Overview When asked if Apache Flume can be used with HDInsight, the response is typically no. We do not currently include Flume in our HDInsight ...read more
  • Blog Post: Customizing HDInsight Cluster provisioning

    In my last blog , I discussed how we can specify Hadoop configurations for a job on an HDInsight cluster. At the end of that blog, I also dicussed the alternative approach where you may want to change certain hadoop configurations from default values ...read more
  • Blog Post: Querying HDInsight Job Status with WebHCat via Native PowerShell or Node.js

    One of the great things about HDInsight is that under the covers, it has the same capabilities as other Hadoop installations. This means that you can use regular Hadoop endpoints like Ambari and WebHCat (formerly known as Templeton) to interact with an ...read more
  • Blog Post: Sliding Window Data Partitioning on Microsoft Azure HDInsight

    HCatalog is a table and storage management layer for Hadoop that enables users with different data processing tools like Pig, Mapreduce, Hive, and Oozie to read and write data. HCatalog's table abstraction presents these tools and users with a relational ...read more
  • Blog Post: HDInsight: - Creating, Deploying and Executing Pig UDF

    During my developer experience, I always look for how customization (write my own processing) can be done if functionality is not available in programming language. That thought was triggered again when I was working on Apache Pig in HDInsight. So I started ...read more
  • Blog Post: How to use parameter substitution with Pig Latin and PowerShell

    When running Pig in a production environment, you'll likely have one or more Pig Latin scripts that run on a recurring basis (daily, weekly, monthly, etc.) that need to locate their input data based on when or where they are run. For example, you ...read more
  • Blog Post: How to use HBase Java API with HDInsight HBase cluster, part 1

    Recently we worked with a customer, who was trying to use HBase Java API to interact with an HDInsight HBase cluster. Having worked with the customer and trying to follow our existing documentations here and here , we realized that it may be helpful if ...read more
  • Blog Post: Some Commonly Used Yarn Memory Settings

    We were recently working on an out of memory issue that was occurring with certain workloads on HDInsight clusters. I thought it might be a good time to write on this topic based on all the current experience troubleshooting some memory issues. There ...read more
  • Blog Post: Azure PowerShell 0.8.14 Released, fixes problems with pipelining HDInsight configuration cmdlets

    We recently pushed out the 0.8.14 release of Azure PowerShell . This release includes some updates to the following cmdlets to ensure that values passed in via the PowerShell pipeline, or via the -Config parameter, are maintained: Set-AzureHDInsightDefaultStorage ...read more
  • Blog Post: Leveraging a Hadoop cluster from SQL Server Integration Services (SSIS)

    With the explosion of data, the open source Apache™ Hadoop™ Framework is gaining traction thanks to its huge ecosystem that has arisen around the core functionalities of Hadoop distributed file system (HDFS™) and Hadoop Map Reduce. As of today, being ...read more