Sitecore – Dean Thrasher

Six Key Technology Themes for 2021

October 19, 2020 dthrasherLeave a comment

For the past few months, I’ve led the EPAM Sitecore Competency Center. It’s been interesting times, to say the least! Much of 2020 was devoted to learning my new role as global head of the Sitecore CC and responding to the Covid-19 pandemic.

Both of these were largely focused internally — but I do hope to write a few posts about some of the interesting accelerators and proofs-of-concept that my fellow EPAMers created during those slow, uncertain months earlier this year. (We’ll showcase several of these at Sitecore Symposium next week.)

But for this post, I’d like to focus on the future, and the 6 themes that will drive digital platforms in 2021:

Decoupled digital systems
Cloud-native architecture
Microservices and API-first development
Optimized digital journeys
Applied testing and analytics
Unified experience profiles

Decoupled digital systems

Decoupling is a strategy in software architecture and code to separate concerns. By splitting features and functions into logically and/or physically separate routines, you can evolve these pieces of a system or program independently. This often improves scalability and maintenance. For content management systems, this means adopting the old idea of the two-stack CMS — treating the management application differently from the content delivery application.

This means you have the ability to tailor experiences for specific devices and for specific purposes. Decoupling also helps reduce the dependencies, which can help speed delivery.

The most extreme form of this can be found in the new wave of “headless” CMS providers, like Contentstack and Contentful. These focus on the management and administration of content through the use of well-defined APIs. But the “head” of the CMS — the delivery side — is all up to you. Sitecore and other CMS platforms are taking a less extreme approach. They provide a head — along with many of the advanced features that come with it — but decouple it from the back-office management application.

Whether you find a truly headless platform or merely a decoupled one a better fit for your needs, it’s clearly the driving theme in digital applications at the moment.

Cloud-native architecture

Many organizations have been surprised to find the benefits of moving to cloud hosting were not what they expected. McKinsey does a great job deconstructing these myths about the cloud. The core insight is that if you treat the cloud simply as “someone else’s datacenter”, you won’t get many of the improvements touted for cloud migration. To reap the benefits, you have to redesign your systems to take advantage of the flexibility that cloud architectures provide. “Lift and shift” is just a first step.

This applies not only to your own organization’s systems, but to many of the software platforms and services you’ve relied on. Most of these will move to a Software-as-a-Service model, and that means designing your systems as loosely connected independent services. That means less code, more configuration, and more integration.

Microservices and API-first development

API-first development underpins my first two themes. By separating concerns into distinct domain services, each with their own API contracts, you improve re-usability and add flexibility to your systems. These small web services can then be scaled and tuned for performance independently of each other.

So the first three trends mutually reinforce each other. Decoupling will lead you to making smaller, more focused applications and services. Standardizing the APIs for these will allow these services to be consumed by and composed into new offerings. And cloud-native architecture will allow these pieces to flex to meet demand and performance targets.

The first three themes were technical; the next three themes have to do with customer experience.

Optimized digital journeys

The recent pandemic underscored the primacy of digital channels for many organizations. And it’s clear that consumers and constituents reward organizations that provide the best digital experiences — through increased sales, positive reviews, and loyalty among other measures. But how can we provide a better digital experience?

The short answer is: figure out what a visitor is trying to do today and make it easy for them to do it. I emphasize the “today” in the last sentence, because often organizations undertake massive data mining operations to study past behavior or try to apply AI or advanced statistical techniques — or perhaps ignore the issue altogether because these projects seem too expensive or complicated! It doesn’t have to be either one of these extremes. It can start with the first question every human customer service representative is trained to ask, “What can I help you with today?”

You can ask this question of your visitors explicitly, or you can use other signals to determine what a visitor is trying to do — like which link an an email the visitor clicked to bring them to the site. Or which social media post they followed. Or what article they landed on through organic search. All of these are useful signals that you can use to help guide and personalize their journey in real time.

And the metrics are also easy to gather. How long did it take them to register an account, check out their cart, or fill out the form? After engaging in one of these journeys, did they respond to your survey? Did they share the article with friends? The chances are, your organization is already collecting much of the information you’d need to optimize the important journeys on your site. Often what organizations lack is time to reflect on these metrics and the know-how required to conduct small experiments. Which leads us to our next theme.

Applied testing and analytics

There’s a big difference between organizations that collect analytics to report status and those that actually apply it to drive improved business outcomes. And 2021 is the year you’ll see the latter kind of organization pull away from the former, due to the disruption caused by the pandemic.

Consumers have had to break a lot of their old habits and adopt new ones. So the companies that can ask smart, targeted questions about critical moments in the customer journey — and then take action on their findings — will distinguish themselves in the marketplace.

There are a range of tools and techniques you can apply. You can do A/B or multivariate tests, measure click-through rates, perform path analysis, conduct focus groups, or provide customer satisfaction surveys. But the tools are less important than being curious and asking interesting questions about the touch-points that serve your customers’ needs.

Unified experience profiles

Once you’ve embraced an experimental mindset and have begun making optimizations, you might be ready for the next step — creating a unified experience profile across all of your digital touch-points.

There are many reasons why a unified, 360-degree view of the customer is beneficial to your organization. One is that it enables you to serve your customers better. For example, your sales team will know what customer support told the customer. When a customer updates profile data, all your systems can see that change. Another is that you can derive new insights about your customers by viewing their behaviors across different channels. And a final reason is that you can better manage sensitive personal information within a comprehensive system than by scattering across numerous back-office systems.

This is a tall order, given the number of functions within your enterprise that rely on this data. Implementing a comprehensive view of customer data is a multi-year effort. But what makes this a powerful theme for 2021 is both the pandemic, which has emphasized digital remote work and de-emphasized paper and face-to-face transactions, as well as privacy and confidentiality regulations worldwide.

Many organizations are well on their way to implementing a 360-view. But harnessing its benefits — not simply reacting to events — will lead to many interesting opportunities.

Summing up

These 6 themes form the new digital playbook that many leading-edge organizations follow as they launch and scale their enterprises. The enterprise solutions I’m working on now involve many, if not all, of these themes.

We’ll be talking about these themes at Sitecore Sympsoium. I’m curious about whether these themes will resonate with others in the Sitecore community and in the wider digital ecosystem.

Sitecore MVP 2020

February 2, 2020 dthrasherLeave a comment

I’m glad to be part of the Sitecore MVP community again! Sitecore announced their MVPs for 2020 last week, so I’m once again a man with a badge: a Sitecore Commerce badge.

This is the second time I’ve been a Sitecore MVP, my first award having been for Commerce in 2018. This year, I’ll be continuing my work with the Sitecore Commerce – Microsoft D365 Retail connector as well as getting involved in my company’s Sitecore Commerce – Hybris connector.

In addition to the ERP connectors we’ve been building, EPAM also released its headless commerce accelerator as open source just before Sitecore Symposium. I think there’s a lot of potential to build on each of these efforts.

EPAM had 14 awardees this year, so I’m in good company. (In both senses of the word!) But I’m especially pleased that my co-presenter of the D365 Connector at Symposium 2019, Vsevolod (Seva) Kolonistov, is also a technology MVP for 2020 — the only one in Russia, and the organizer of the St. Petersburg Sitecore User Group.

It’s going to be an exciting 2020!

Sitecore Symposium 2019 Customer Showcase

November 23, 2019 dthrasherLeave a comment

Every year at Sitecore Symposium, Sitecore features customer success stories. This year, I had the honor to co-present the relaunch of the Lowe’s Canada website with Tanbir Grover, VP of Digital and Omnichannel at Lowe’s Companies Canada.

Sitecore Symposium 2019 customer showcase — On the big stage at Sitecore Symposium 2019

In August 2019, after nearly 3 years of work (if you include the the first proof-of-concept in 2016), the new website went live across Canada. The results so far have been very promising, especially for mobile ecommerce. Tanbir was able to share some impressive statistics. More important than the numbers, though, is the improvements in flexibility and agility. The retail sector has been reinventing itself, and now the digital team at Lowe’s Canada can respond to changing trends and innovate in key areas such as content marketing, personalization, and search.

I wish we’d been able to put the whole team on stage — from Lowe’s Canada and EPAM. This project would not have succeeded without a lot of very smart people from around the world solving difficult commerce problems at scale. I’ve very proud to have been a part of that team.

Sitecore Solr Cloud Support

May 16, 2018May 16, 2018 dthrasherLeave a comment

On the roadmap for future releases of Sitecore 9 is true SolrCloud support. This has been a sticking point with many scaled implementations of Sitecore since SolrCloud was first introduced in 2013. For the most part, Sitecore implementations have relied upon either the Solr Master-Slave model to ensure high availability and load balancing (at least for queries) or have muddled through with Solr Cloud approximations.

A history lesson

To understand where Sitecore stands with regard to Solr Cloud support, you have to know a little history behind how search was implemented in Sitecore. There are two mechanisms for finding items in Sitecore. The first is to use Sitecore’s database query mechanisms, which rely on an XPath-like syntax for traversing the content tree. This is a slow method, but is used for simple queries to traverse parent-child relationships and is often used with Sitecore’s field types that display hierarchical relations: the treelist, droplist, and droptree, for example.

The second method is to use a full text search provider. Back in the Sitecore 4 or 5 days, this was dtSearch, but Lucene quickly became the default full text search provider, due to the extensive documentation and examples available. Like dtSearch, Lucene was a full-text indexing engine meant to be used in embedded applications. These tools worked great on a developer machine, or on a single-sever “all-in-one” demo configuration, but caused issues when deploying into a multi-server environment. Since each server maintained its own index, issues with out-of-sync indexes were extremely common.

Solr, an extension of the Apache Lucene project, was introduced to solve this scalability issue. It offered an enterprise platform that used the Lucene API and supported many of the extensions and plugins developed by that community, and allowed indexing and search functions to run on a separate instance. Since the API was largely compatible with Lucene, it was easy for Sitecore and other CMS platforms to transition to Solr.

Solr scalability models

The early approach to Solr scalability was the master-slave model. One server in a Solr cluster — the master — performs all the write operations, but any server in the cluster can respond to queries. This approach is one of eventual consistency for read operations. If a slave fails, the load can be redistributed to the other members of the cluster. If the master fails, however, no new write operations can be performed until one of the slaves can be reconfigured to act as the new master.

Master-slave was a challenge for Sitecore for two reasons.

Sitecore assumes that its indexing and search server shares a single URL. But really, it should know of two URLs: the load-balanced URL used for reads, and the single URL that points to the primary Solr server for writes.
To recover from a master failure, manual intervention was required to promote a slave to a master. This requires a reboot of the slave Solr server, which is why a true high-availability Solr master-slave arrangement should really have been called a “master-slave-slave” configuration, and needs to contain a minimum of three instances.

To address the first problem, you need to configure your load balancer to spread all GET requests across all of your master and slave instances, but to route POST requests only to the master server. Or you need to modify the Sitecore code and configuration to separate writes from reads, and send each to a different URL. (The second approach may be required if your query parameters become so complex that you run into the GET request character limit for the query string.)

To address the second problem — which was an issue for all systems using Solr, not just Sitecore — the Solr project introduced SolrCloud.

SolrCloud has nothing to do with cloud computing, despite its name. You can have on-premise datacenter deployments of SolrCloud. It’s just the name given to the new high availability and scalability model that replaces the old master-slave-slave approach.

In a SolrCloud, ZooKeeper nodes are used to determine which Solr instance acts as the primary instance responsible for write operations. If the primary fails, one of the secondaries is promoted to act as the new primary automatically. There may be a few write errors as the problem gets detected, a new election of a primary occurs, and the new primary takes over write responsibilities, but this is generally a short interval. No manual intervention is required.

This is a great approach, except for one problem: Sitecore (and the Solr libraries it depends on) don’t understand ZooKeeper.

Who’s in charge of this zoo?

In the Java world, the Solr4j library has had support for ZooKeeper “baked-in” so that clients don’t have to figure out which Solr node is the primary. The CloudSolrClient in the Solr4j library client communicates with the Zookeeper nodes to discover Solr endpoints. For write operations, it helps determine which Solr instance will handle the write requests, and all index update operations are transparently sent to that instance. For read operations, the CloudSolrClient uses the LBHttpSolrClient class as a software load balancer. There’s no additional work required!

But Sitecore is a Microsoft.NET application, and its ContentSearch API ultimately relies upon a library called Solr.NET. Solr.NET is not maintained by the Apache Solr project itself, but by a group of open source developers. As a result, Solr.NET support for Solr versions often lag behind that of the Java client libraries. This is why Sitecore only supports up to the Solr 6 series, even though Solr 8 will be released very soon.

And for a long time, Solr.NET didn’t fully support Solr 6 — which is why it didn’t have the CloudSolrClient baked in, or any knowledge of how to communicate with Zookeeper. But as of January 2018, developers can finally get a version of Solr.NET with support for SolrCloud and versions up to Solr 7. There’s even a SolrNet.Cloud NuGet package.

Now all we have to do is wait for Sitecore to incorporate it into the .NET platform. What about it Sitecore? Can we have it in time for Symposium this year? Or at least by Christmas?

Sitecore MVP 2018

April 12, 2018February 2, 2020 dthrasherLeave a comment

I was pleased and excited to be named a Sitecore Commerce MVP for 2018! While I have worked with Sitecore for many years, this is the first time I have earned an MVP award.

The official announcement is here: https://www.sitecore.com/company/press-and-media/press-releases/2018/01/sitecore-names-its-2018-most-valuable-professionals

Typically, I’m so busy with client implementations, I don’t have much time to focus on social media and knowledge sharing, apart from my work with the DC Sitecore User Group. But during 2017, I had the opportunity to work closely with a major retail client and the Sitecore product development team for their Sitecore Connect for Microsoft Dynamics 365 module. It’s been an awesome experience working directly with Sitecore on fast-moving Microsoft tech in the Commerce space.

For more details about the Sitecore MVP program, and a breakdown of statistics regarding the awardees for this year, see: https://www.sitecore.com/company/blog/521/announcing-the-2018-sitecore-mvp-awards-4508

SCpbMD Catalog Sync Explained

November 27, 2017 dthrasherLeave a comment

I’ve been meaning to post this diagram for a while. I’ve used this to explain the Sitecore Commerce catalog data sync operation to at least three different clients since I drafted it this summer. And although I created it for the Microsoft Dynamics 365 version of the connector, it’s similar for Dynamics AX and other PIM systems as well.

In the D365 box, you have the UI application, which admins can use to publish catalog data once it has been validated. There are a lot of elements that must be configured an working correctly to have a valid catalog, but a few of the key pieces are:

An online navigation hierarchy for your online channel
An assortment for your online channel containing released products
A catalog associated with the online channel
Products assigned to nodes on the online navigation hierarchy
Product attributes defined and attached to nodes on your online navigation hierarchy

Once the catalog is published, it’s ready to go as far as the “headquarters” database is concerned. But in Dynamics AX / D365, it also has to be distributed — sent to the online channel database using distribution jobs.

Once the catalog is in the online channel database, the D365 Retail Server can read from it. External applications can read catalog data from the online channel database using the Retail Server APIs, a curious mix of web services that aren’t quite WCF and aren’t quite REST.

This is where the Sitecore part of the picture comes into play. Sitecore provides a sample console application that uses Sitecore’s Data Exchange Framework to fetch data from the Dynamics Retail server. It transforms it into an XML file that can then be imported into Sitecore’s Commerce Server. (Sitecore 9 also uses this catalog.xml file format, though the old commerce server components are no longer used.)

This places the product and category definitions and data into the product catalog database. This product catalog database acts as an “edge cache” that keeps just the products the site will use close to the infrastructure of the website itself. It provides some redundancy in case that communications problems occur between Sitecore and D365.

The last step in the process is the catalog data provider. Sitecore XP uses a data provider to access the product catalog database, creating virtual Sitecore items that appear within the Sitecore UI. Product and category data are not stored as “real Sitecore items” in the sense that they live in the standard Sitecore master or web databases.

Watch the arrows!

Note the color and direction of the arrows in the diagram above. The orange arrows are the ones controlled by the Sitecore console app and Data Exchange Framework. The arrows in blue are either part of standard D365 functionality or belong to extensions to the Sitecore platform. The orange arrows could have been labeled “Extract, Transform, and Load” because that’s exactly the operations performed by the catalog sync. (If I ever redraw the diagram, I might update the labels to say just that!)

The direction of the arrows are important, too. Catalog information must be sent from AX HQ to the channel database by those batch distribution jobs. If those jobs aren’t running, then no updates occur in the channel DB, and no updates will be returned in by the Retail Server API.

Once the data is available at the Retail Server, it’s up to the Sitecore catalog sync process to fetch the latest data from the Retail Server. This can be run manually or as a scheduled job, but note that there are no notifications here — D365 doesn’t push the data to Sitecore, Sitecore pulls the data when it needs to.

A sequence of batches

This is definitely not a real-time process. As you might imaging from the number of batches, pushes, and pulls shown in the diagram, it can take a significant amount of time to move an update — like a adding a new product to the catalog or setting up a new product attribute for a category — from D365 HQ into Sitecore XP. If every batch job involved executed on a 15 minute timer, it could take 45-60 minutes for that product to appear on the site. The interval could be longer depending on the size of the catalog and the number of changes made.

There’s more than one way to do it

Although Sitecore provides the catalog sync code as part of its commerce connectors, it’s really just an example or starter kit for us to use. In practice, you’ll need to modify the logic used to generate the catalog.xml file to import into Sitecore. You may also need to move the data sync process to other servers for scalability or performance reasons. Or you could replace Sitecore’s Data Exchange Framework with another ETL framework or a business process orchestration suite like BizTalk.

The connector is just a starting point for implementation, and hopefully the diagram and my explanation of it makes a good starting point for discussion with your team or client about how the process of syncing catalogs might work.

Sitecore Wildcard Items

November 27, 2017November 27, 2017 dthrasherLeave a comment

In a recent presentation to the DC Sitecore User Group, I was surprised to learn that most of the technical attendees didn’t know about Sitecore Wildcard Items. This hidden gem has been in Sitecore since at least Sitecore 4, and allows you to resolve item data however you like. For those of you with an ASP.NET MVC background, it’s like defining a route parameter at a particular URL segment. It’s an interesting example of what you can do with an httpRequestProcessor in Sitecore’s httpBeginRequestPipeline.

The original implementation was described by John West, former CTO of Sitecore, but his original blog post is lost to the Internet. You can find a discussion of Wildcard Items on page 39 of his book, Professional Sitecore Development.

Here’s how it works: Within the Sitecore content tree, you give an item a name of “*”. This item will act as a wildcard, matching any item at that level that doesn’t already have a sibling whose name explicitly matches that URL segment.

The beauty of this is that you can treat these items as regular Sitecore items — you can set presentation details, add them to workflow, etc. — but you can map the data to other Sitecore items or even an external database. For example, Sitecore uses wildcard items to resolve products within their Sitecore Commerce Reference Storefront implementation.

To implement a Wildcard Item, you’ll need to create two things:

A Sitecore Wildcard Item Resolver
An optional LinkProvider, so you can generate valid URLs for these items and reference them elsewhere in the site.

If you want to take a crack at implementing this yourself, have a look at Gaurav Agarwal’s post on Resolving the Wildcard Item, which has some code snippets to get you started.

There’s also an old Sitecore Wildcard Module, which does essentially the same thing, but uses the Sitecore Rules Engine to resolve the correct item from the Sitecore content tree. See Adam Conn’s post on Wildcards and Data-Driven URLs for details. It’s been available since 2011 with Sitecore 6, but could be modified to work in modern versions of Sitecore with a little work. I found a developer that created a revised wildcard module for Sitecore 7, for example.

Also, when using ASP.NET MVC, keep in mind that sometimes other pipelines might reset the Sitecore context item after your custom wildcard item resolver finishes its work. See Kamruz Jaman’s post on the Sitecore MVC Context Item for help troubleshooting this issue.

Wildcard items are a powerful technique, and can save a lot of hassles — I’ve seen many implementations “break out” of Sitecore to use MVC routing, then hack back in a Sitecore context or session object. But wildcard resolution keeps you within within the Sitecore stack and are a more natural approach to this problem. You can learn a lot about how Sitecore’s request handling pipelines operate by studying how it works and applying it in your own solutions.

Sitecore Commerce Catalogs at Scale

September 17, 2017 dthrasherLeave a comment

Last week, I gave a presentation to the DC Sitecore User Group on Sitecore Commerce Catalogs. It was a small crowd due to some thunderstorms in the area, and I had a tough act to follow. Phil Wickland, Sitecore MVP and author of several books on Sitecore, gave a talk on Personalization for Impact, which is worth seeing.

My talk was about how and why Sitecore imports catalog data from a PIM, using the Sitecore Commerce and Microsoft D365 integration as an example.

Here’s a link to the video:

The audio is a bit hard to hear at times, but I’ve posted my slides to slideshare here:

Sitecore Commerce Catalog Management at Scale from Dean Thrasher