Blog Archives

Desktop Automation Made Easy: A Winium, Java, and BDD Guide

by Yogesh Dhole | Apr 11, 2024 | Blog

Introduction:

Desktop application test automation can be a tedious task as it’s hard to locate the elements and interact with those elements. There are plenty of tools available for automating desktop applications. Winium is one of those tools which is a selenium-based tool. So for those who don’t have an idea about Selenium, Selenium is a web application test automation tool that supports almost all programming languages. (Wish to learn more about selenium? Check out the link here) If you are familiar with the Selenium tool then it’s going to be easy for you to understand the workings of the Winium tool as most of the methods are common and if you are not familiar with Selenium no worries, I have got you covered.

Coming back to our topic, In this blog we will see how we can create a robust test automation framework for automating desktop applications using Winium a desktop application automation tool, Java as a programming language, Maven, as a dependency management tool, Cucumber as a BDD (Behavior Driven Development) tool. We are going to build a test automation framework from scratch. Even if you don’t have any idea on how to create a framework no worries.

Before we start building the framework let’s complete the environment set-up. So for this, we will have to install some tools. Below I am sharing the URLs of the tools we are using just in case if you want to know more about these tools then you can visit these official web pages.

Winium Driver- https://github.com/2gis/Winium.Desktop
Cucumber – https://cucumber.io/
Java – https://www.java.com/en/
Maven – https://maven.apache.org/

Setting up the environment for Desktop Automation:

Java Set-up :

As we are using Java programming it is a must to have JDK installed on the system.
You can download the JDK (Make sure the version is greater than 8) For instance I have the Java 11.0.16.1 version set up on my system.
Use this link to download the Java SE Development Kit – https://www.oracle.com/in/java/technologies/javase/jdk11-archive-downloads.html
Once the download is completed the next step is setting up the path in the environment variables. Check the below screenshots to set up the path in your system environment variables

Once you are done with the above steps then you should see the below information in the command prompt.

Maven Set-up :

Once you are done with Java Installation and set up the next step is to do the installation and set up the maven.
To download the maven you can visit this official web page – https://maven.apache.org/download.cgi
Again after installation, we have to set up the environment variable path for our system.
See the below screenshots for your reference.

Intellij Idea IDE Set-up :

For this framework, I am using Intellij Idea IDE. You can download and set up IDE using the below link.
https://www.jetbrains.com/idea/download/?section=windows

Winium Driver Set-up :

To interact with the desktop elements we need a winium driver just like we need the browser drivers to automate the web pages.
To download the winium driver use this link – https://github.com/2gis/Winium.Desktop/releases

Inspect.Exe Set-up :

We need an inspector to inspect the desktop application elements. We use different approaches and tools to inspect and locate the web page elements.
Here we will use the Inspect.exe tool to inspect and locate the desktop application elements.
Use this link to download and install the tool – https://github.com/blackrosezy/gui-inspect-tool/blob/master/Inspect.exe
Not only that there are other desktop application element inspection tools.
Once you are done with the above steps then we can start building the automation framework.

Implementing BDD with Cucumber for Desktop Automation:

The BDD (Behavior-Driven-Development) is a software development approach that focuses on collaboration among stakeholders, including developers, QA engineers, and business analysts. The reason behind this is that in the BDD approach, we use natural language specifications to describe software behaviour from the end user’s perspective. I believe this helps in creating a shared understanding of requirements and promotes effective communication throughout the development lifecycle. Let’s see this in detail,

Feature File Creation :

Feature files are the main component of the BDD cucumber framework we can even say they are the heart of this cucumber framework.
These files are written using gherkin language which describes the high-level functionalities of the application.
Cucumber is a widely used BDD tool as it allows us to write test cases (scenarios) in plain tests using the Gherkin syntax.
This is because Gherkin uses keywords like, Given, When, And, and Then to structure scenarios, making it easy to read and understand by both technical and non-technical stakeholders.
Here is the one scenario that I have created for this framework.

@winiumApp
Feature: To verify the draw functionality of AutoCAD software
 As a User I want to launch the application
 and validate that I can access the different functionalities of the application.


 @smoke
 Scenario: Verify user can launch and open the new document using microsoft word application
   Given User launches the microsoft word application
   When User verifies the landing screen is visible with "Recent" opened document list
   And User clicks on "Blank document" option to add blank document
   Then User verifies that "Page 1 content" a new page for opened blank page is visible

Step Definition File Creation :

Yes, that’s correct. Step definition files contain code that maps the steps in the feature file to automation code.
These files are written using the programming language used in the automation framework, in this case, Java.
The step definitions are responsible for interacting with the elements of the application and performing actions on them such as clicking, entering text, etc.
They also contain assertions to check if the expected behaviour is observed in the application.

package com.SpurCumber.Steps;
 
import com.SpurCumber.Pages.DemoWiniumAppPage;
import com.SpurCumber.Utils.ScreenshotHelper;
import com.SpurCumber.Utils.TestContext;
import io.cucumber.java.en.And;
import io.cucumber.java.en.Given;
import io.cucumber.java.en.Then;
import io.cucumber.java.en.When;
import org.testng.Assert;
 
 
public class DemoWiniumAppSteps extends TestContext {
   private final DemoWiniumAppPage demoWiniumAppPage;
 
   public DemoWiniumAppSteps() {
       demoWiniumAppPage = new DemoWiniumAppPage(winiumdriver);
   }
   @Given("User launches the microsoft word application")
   public void userLaunchesTheMicrosoftWordApplication() {
       scenario.log("The application is launched successfully!");
       ScreenshotHelper.takeWebScreenshotBase64(winiumdriver);
       ScreenshotHelper.captureScreenshotAllure(winiumdriver, "User launches the microsoft word application");
   }
 
   @When("User verifies the landing screen is visible with {string} opened document list")
   public void userVerifiesTheLandingScreenIsVisible(String arg0) throws InterruptedException {
       Assert.assertTrue(demoWiniumAppPage.verifyScreen(arg0));
       ScreenshotHelper.takeWebScreenshotBase64(winiumdriver);
       ScreenshotHelper.captureScreenshotAllure(winiumdriver, "User verifies the landing screen is visible with "+arg0+" opened document list");
   }
 
   @And("User clicks on {string} option to add blank document")
   public void userClicksOnOptionToAddBlankDocument(String arg0) throws InterruptedException {
       demoWiniumAppPage.clickBtnByName(arg0);
       ScreenshotHelper.takeWebScreenshotBase64(winiumdriver);
       ScreenshotHelper.captureScreenshotAllure(winiumdriver, "User clicks on "+arg0+" option to add blank document");
   }
 
   @Then("User verifies that {string} a new page for opened blank page is visible")
   public void userVerifiesThatANewPageForOpenedBlankPageIsVisible(String arg0) throws InterruptedException {
       Assert.assertTrue(demoWiniumAppPage.verifyScreen(arg0));
       ScreenshotHelper.takeWebScreenshotBase64(winiumdriver);
       ScreenshotHelper.captureScreenshotAllure(winiumdriver, "User verifies that "+arg0+" a new page for opened blank page is visible");
   }
}

Hooks File Creation :

In Cucumber, hooks are methods annotated with @Before and @After that run before and after each scenario.
To ensure consistency between test environments, these hooks are used for setting up and taking down tests.
The application can be initialized before and cleaned up after each scenario using hooks, for example.

package com.SpurCumber.Steps;
import com.SpurCumber.Utils.TestContext;
import com.SpurCumber.Utils.WiniumUtil;
import io.cucumber.java.After;
import io.cucumber.java.Before;
import io.cucumber.java.Scenario;
import java.io.IOException;
 
public class BaseSteps extends TestContext {
   @Before("@winiumApp")
   public void InitializeWiniumApp(Scenario scenario1) throws IOException, InterruptedException {
       scenario = scenario1;
       WiniumUtil.WiniumServerInit();
       Thread.sleep(3000);
       winiumdriver = WiniumUtil.WiniumIniit();
   }
 
   @After("@winiumApp")
   public void TearDownWiniumApp() {
       WiniumUtil.stopWiniumDriver();
       WiniumUtil.WiniumTearDown();
   }
}

Implementing Page Object Model (POM) for Desktop Automation:

The Page Object Model (POM) is a design pattern that assists in building automation frameworks that are scalable and maintainable. In POM, we create individual page classes for each application page or component, which encapsulates the interactions and elements associated with that particular page. This approach improves code readability, reduces code duplication, and enhances test maintenance.

package com.SpurCumber.Pages;
 
import org.openqa.selenium.WebElement;
import org.openqa.selenium.winium.WiniumDriver;
 
public class DemoWiniumAppPage {
 
   private final WiniumDriver winiumdriver;
 
   public DemoWiniumAppPage(WiniumDriver _winiumdriver) {
       this.winiumdriver = _winiumdriver;
   }
 
   public Boolean verifyScreen(String locator) throws InterruptedException {
       WebElement Screen = winiumdriver.findElementByName(locator);
       return Screen.isDisplayed();
   }
 
   public void clickBtnByName(String locator) throws InterruptedException {
       WebElement element = winiumdriver.findElementByName(locator);
       Thread.sleep(3000);
       element.click();
   }
}

Creating Utility Files to Support Framework:

In a test automation framework, utility files provide reusable functionalities, configurations, and helper methods to streamline the development, execution, and maintenance of test scripts. As a result, they enhance the efficiency, scalability, and maintainability of the automation framework. Listed below are a few common utility files, along with their functions:

Winium Util File :

This utility file handles the launch and termination processes of the desktop application, as well as the Winium driver
When we use Winium as a desktop application automation tool we have to start the server. (Winium Driver).
Either we can do this manually before starting the execution of the test case or we can do this through automation as well.
In the below utility file there are methods created for launching the desktop application and Winium driver (server).

package com.SpurCumber.Utils;
 
import org.openqa.selenium.winium.DesktopOptions;
import org.openqa.selenium.winium.WiniumDriver;
import java.net.URL;
public class WiniumUtil {
 
   private static WiniumDriver winiumdriver = null;
   private static Process winiumDriverProcess;
 
   public static WiniumDriver WiniumIniit() {
       try {
           DesktopOptions option = new DesktopOptions();
           option.setApplicationPath("C:\\ProgramData\\Microsoft\\Windows\\Start Menu\\Programs\\Word.lnk");
           winiumdriver = new WiniumDriver(new URL("http://localhost:9999"), option);
       } catch (Exception e) {
           e.printStackTrace();
       }
       return winiumdriver;
   }
 
   public static void WiniumTearDown() {
       if (winiumdriver != null) {
           winiumdriver.quit();
       }
       winiumdriver = null;
   }
 
   public static void WiniumServerInit() {
       try {
           String driverpath = CommonUtil.getResourceDirPath("Winium.Desktop.Driver.exe");
           String command = driverpath + " -p " + "9999";
           winiumDriverProcess = new ProcessBuilder("cmd.exe", "/c", command).start();
           Thread.sleep(5000);
       } catch (Exception e) {
           e.printStackTrace();
       }
   }
 
   public static void stopWiniumDriver() {
       try {
           winiumDriverProcess.destroy();
           Thread.sleep(3000);
       } catch (Exception e) {
           e.printStackTrace();
       }
   }
}

Common Util File :

This common util file reads or retrieves the values and files present in a particular folder (referenced here as the resource folder).
This file can further serve as a basis for developing additional common methods usable throughout the framework.

package com.SpurCumber.Utils;
 
import java.io.File;
import java.nio.file.Paths;
 
public class CommonUtil {
 
   public static String getResourceDirPath(String parameter) {
       String assemblyLocation = System.getProperty("user.dir");
       String path = Paths.get(assemblyLocation+"/src/test/resources/"+parameter).toString();
       return new File(path).getAbsolutePath();
   }
}

Test Runner File :

The TestRunner class executes Cucumber tests with specified configuration settings, including the location of feature files, step definitions package, inclusion tags, and report generation plugins.
The seamless integration of Cucumber tests into TestNG makes testing and reporting easy.

package com.SpurCumber.Utils;
 
import io.cucumber.testng.AbstractTestNGCucumberTests;
import io.cucumber.testng.CucumberOptions;
import org.testng.annotations.DataProvider;
 
 
@CucumberOptions(features = {"src/test/java/com/SpurCumber/Features"},
       glue = ("com.SpurCumber.Steps"),tags="@winiumApp",
       plugin = { "pretty","io.qameta.allure.cucumber7jvm.AllureCucumber7Jvm","pretty", "json:test-output/Cucumber.json","html:STDOUT","html:test-output/Cucumber.html" })
public class TestRunner extends AbstractTestNGCucumberTests {
 
   @DataProvider
   @Override
   public Object[][] scenarios() {
       return super.scenarios();
   }
}

Executing Tests:

Once we have defined the test scenarios, we will use Maven commands to execute them. Maven is a robust tool that manages project dependencies and automates the build process. With Maven, we can run automated tests with ease and ensure a smooth and efficient testing process.

Configuring Maven POM File(Pom.xml):

In the project’s Maven Project Object Model (POM) file, we define the necessary configurations for test execution.
This includes specifying the test runner class, defining the location of feature files and step definitions, setting up plugins for generating test reports, and configuring any additional dependencies required for testing.

<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
        xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
   <modelVersion>4.0.0</modelVersion>
   <groupId>com.SpurQLabs</groupId>
   <artifactId>SpurCumber</artifactId>
   <version>1.0-SNAPSHOT</version>
   <properties>
       <maven.compiler.source>11</maven.compiler.source>
       <maven.compiler.target>11</maven.compiler.target>
   </properties>
   <dependencies>
       <dependency>
           <groupId>io.cucumber</groupId>
           <artifactId>cucumber-java</artifactId>
           <version>7.12.0</version>
       </dependency>
       <dependency>
           <groupId>io.cucumber</groupId>
           <artifactId>cucumber-core</artifactId>
           <version>7.11.1</version>
       </dependency>
       <dependency>
           <groupId>io.cucumber</groupId>
           <artifactId>cucumber-testng</artifactId>
           <version>7.11.1</version>
       </dependency>
       <dependency>
           <groupId>io.cucumber</groupId>
           <artifactId>cucumber-picocontainer</artifactId>
           <version>7.11.1</version>
           <scope>test</scope>
       </dependency>
       <dependency>
           <groupId>io.rest-assured</groupId>
           <artifactId>json-path</artifactId>
           <version>5.3.0</version>
           <scope>test</scope>
       </dependency>
       <dependency>
           <groupId>io.rest-assured</groupId>
           <artifactId>json-schema-validator</artifactId>
           <version>5.3.0</version>
           <scope>test</scope>
       </dependency>
       <dependency>
           <groupId>org.testng</groupId>
           <artifactId>testng</artifactId>
           <version>7.7.1</version>
           <scope>test</scope>
       </dependency>
       <dependency>
           <groupId>io.qameta.allure</groupId>
           <artifactId>allure-cucumber7-jvm</artifactId>
           <version>2.24.0</version>
           <scope>test</scope>
       </dependency>
       <!-- https://mvnrepository.com/artifact/io.qameta.allure/allure-java-commons -->
       <dependency>
           <groupId>io.qameta.allure</groupId>
           <artifactId>allure-java-commons</artifactId>
           <version>2.21.0</version>
       </dependency>
       <dependency>
           <groupId>org.seleniumhq.selenium</groupId>
           <artifactId>selenium-java</artifactId>
           <version>3.141.59</version>
           <scope>test</scope>
       </dependency>
       <!-- https://mvnrepository.com/artifact/com.github.2gis.winium/winium-webdriver -->
       <dependency>
           <groupId>com.github.2gis.winium</groupId>
           <artifactId>winium-webdriver</artifactId>
           <version>0.1.0-1</version>
       </dependency>
 
   </dependencies>
   <build>
       <plugins>
           <plugin>
               <groupId>org.apache.maven.plugins</groupId>
               <artifactId>maven-compiler-plugin</artifactId>
               <version>3.8.1</version>
               <configuration>
                   <source>11</source>
                   <target>11</target>
               </configuration>
           </plugin>
           </plugins>
 
 
   </build>
   <repositories>
       <repository>
           <id>clojars</id>
           <name>Clojars</name>
           <url>https://repo.clojars.org/</url>
       </repository>
   </repositories>
</project>

Running Tests with Maven Commands:

Once you configure the automated tests in the Maven POM file, you can run them using Maven commands from the terminal or command prompt. Common Maven commands used for test execution include:

mvn test – This command runs all the tests from the project.
mvn clean test – This command first cleans the project (removes the target directory) and then runs the tests.
mvn test “-Dcucumber.filter.tags=@tagName” – This command runs tests with specific Cucumber tags.

Generating Cucumber Reports:

Cucumber provides built-in support for generating comprehensive test reports. By configuring plugins in our automation framework, we can generate detailed reports that showcase the test results, including passed, failed, and pending scenarios. These reports offer valuable insights into the test execution, helping us identify issues, track progress, and make data-driven decisions for test improvements.

Conclusion:

Automating desktop applications with Winium, Java, and Behavior-Driven Development (BDD) using Cucumber is a strategic approach that offers numerous benefits to software development and testing teams. By combining these technologies and methodologies, we create a robust automation framework that enhances software quality, reduces manual efforts, and promotes collaboration across teams.

In conclusion, automating desktop applications with Winium, Java, and BDD using Cucumber empowers teams to deliver high-quality software efficiently. By leveraging the strengths of each technology and following best practices such as the Page Object Model and Maven integration, we create a solid foundation for successful test automation that aligns with business goals and enhances overall product quality.

Source Code:

You can access the complete source code of the created automation framework for desktop applications using Winium, Java, and BDD with Cucumber on GitHub at https://github.com/spurqlabs/Desktop-App-Winium-Java-Cucumber The framework includes feature files, step definitions, page classes following the Page Object Model, Maven dependencies, and configuration files for generating Cucumber reports. Feel free to explore, fork, and contribute to enhance the framework further.

Desktop Automation Switching Between the Windows

by Kishori Fale | Mar 1, 2024 | Blog

While automating web applications or desktop interfaces, we often encounter instances where multiple windows have to be managed simultaneously within an application. Such scenarios may demand a method to navigate seamlessly between these windows and execute the necessary actions. Usually, it’s tempting to perform actions on all windows simultaneously. However, the reality is that such simultaneous actions are not always feasible. Hence, we must employ techniques to switch between windows efficiently using Desktop Automation, allowing us to execute the required tasks effectively. This requires us to develop strategies for smoothly transitioning between windows as needed, ensuring that we carry out every action precisely where it’s needed most.

During a test case execution, it’s not uncommon for multiple windows to come into play. That’s where window handling steps in – it’s the savvy technique that ensures smooth management of these windows. Picture this: as new windows pop up during execution, we seamlessly shift the driver’s focus to the newly opened window. This pivotal maneuver grants us the control we need to execute subsequent actions with precision. Whether we’re testing desktop automation or interacting with applications, mastering window handling is paramount. It’s the key that unlocks the door to seamless, effective testing procedures.

In desktop automation applications, a window can be either the pop-up window coming within the application or a new application itself. Unlike WebDriver, winAppDriver does not provide any built-in methods to handle the multiple open windows within the application or other Windows applications. In desktop applications automation it is a task of testers to write their methods to perform switching between multiple windows as per the requirement of the automation.

As testers, navigating between actions is an inevitable part of our role. These transitions are not just incidental; they’re essential to the testing process, allowing us to scrutinize the application’s behavior comprehensively. Picture this: sometimes, we find ourselves toggling between pop-up windows within the application, or perhaps switching to an entirely new application altogether. It’s all part of the meticulous process to evaluate how the application under test behaves in various scenarios. Now, when it comes to desktop automation, there’s an array of techniques at our disposal for seamless switching methods, each offering its own unique approach like

switching the window with the window name
switching the window with the Class name
switching the window with the window name and window class name(combined)
switching the window with the current date etc.

In this blog, we will explore how we can switch between the two desktop applications through automation.

In the following example, I will explain how we can switch between a calculator and an Excel workbook with the name of the application.

Let’s embark on a journey with our trusty companion, an Excel workbook named “InputFile,” adorned with four columns: Operand1, Operand2, Operator, and Result. While Operand1, Operand2, and Operator columns are already graced with some sample data, the Result column eagerly awaits its destiny. Our mission? To extract data from this Excel sanctuary, employ it for calculations within our calculator, and then gracefully usher the results back into the workbook.We have three ways to perform this task:

Launch both the applications within the scenarios one after another and do the switching between them.
Keep one application open before the execution of the scenario launch the second application and do the switching between them.
Keep both applications open and do the switching between them.

In this blog, we will follow the second way to explain the example of switching. So let us follow these steps:

Launch the desktop calculator through automation.
Switch to the Excel workbook which we will keep open before the start of the execution and will read the operands and operators.
Then switch back to the calculator and do the calculations as per data from the Excel workbook, then read the result and save that result in the workbook after switching on it.

Launch Desktop Application

public static WindowsDriver winAppInit() {
   try {
       if (!serverListening("localhost", 4723))
           Runtime.getRuntime().exec("cmd /c start cmd.exe /K " + ConfigUtil.getPropValues
                   ("WinAppDriverLocation"));
       sleep(2000);
       DesiredCapabilities capabilities = new DesiredCapabilities();
       capabilities.setCapability("app", "Microsoft.WindowsCalculator_8wekyb3d8bbwe!App");
       capabilities.setCapability("platformName", "Windows 10");
       capabilities.setCapability("ms:experimental-webdriver", true);
       winAppDriver = new WindowsDriver(new URL("http://127.0.0.1:4723/"), capabilities);
       winAppDriver.manage().timeouts().implicitlyWait(2, TimeUnit.SECONDS);
   } catch (Exception e) {
       e.printStackTrace();
   } finally {
   }
   return winAppDriver;
}

The above method is to launch any desktop application using winAppDriver selenium.

Windows application driver is listening for requests at http://127.0.0.1:4273/. So, To establish a connection with a computer we use “localhost” as the default name and a “4273” port for communication between the automation framework and the target desktop application.

Let’s split the above method into 3 parts

Part 1:

Check if a server is listening on localhost specifically on port 4723 using the “serverListening” function. If the server is not listening, it executes a command using Runtime.getRuntime().exec() to open the WinAppDriver executable.

Part 2:

Creates a new instance of DesiredCapabilities, which is a class used to specify the desired capabilities or configuration for the automation session. It sets the capability “app” to the value “Microsoft.WindowsCalculator_8wekyb3d8bbwe!App”. This indicates that we are automating the Windows Calculator application. The value represents the application’s package name and the entry point or application identifier. It also sets the capability “platform name” to the value “Windows 10”. It specifies the target platform for the automation session, indicating that it should run on a Windows 10 machine. Apart from that, It sets capability “ms:experimental-webdriver” to true. This capability enables experimental features in the WinAppDriver.

Part 3:

Create a new instance of WindowsDriver using the URL “http://127.0.0.1:4723/” (localhost:4723) as the server URL and the previously configured capabilities. This establishes a connection to the WinAppDriver server and initializes the WindowsDriver instance for automating the Windows Calculator application. Finally, the method returns the initialized WindowsDriver object.

Switch to the Window Name Method in Desktop Automation

public static WindowsDriver switchToWindowWithName(String name) throws MalformedURLException,
       InterruptedException {
   DesiredCapabilities capabilities = new DesiredCapabilities();
   capabilities.setCapability("app", "Root");
   capabilities.setCapability("platformName", "Windows");
   capabilities.setCapability("ms:experimental-webdriver", true);
   rootDriver = new WindowsDriver(new URL("http://127.0.0.1:4723"), capabilities);
   sleep(3);
   List<WebElement> windows = rootDriver.findElementsByXPath("/Pane/Window");
   for (WebElement window : windows) {
       String windowName = window.getAttribute("Name");
         if (windowName.contains(name)) {
           sleep(100);
           String hex = window.getAttribute("NativeWindowHandle");
           int handle = Integer.parseInt(hex);
           String windowHandle = Integer.toHexString(handle);
           windowHandle = "0x" + windowHandle;
           DesiredCapabilities caps = new DesiredCapabilities();
           caps.setCapability("appTopLevelWindow", windowHandle);
           caps.setCapability("platformName", "Windows");
           caps.setCapability("ms:experimental-webdriver", true);
           winAppDriver = new WindowsDriver(new URL("http://127.0.0.1:4723"), caps);
           // todo change the time
        winAppDriver.manage().timeouts().implicitlyWait(40, TimeUnit.SECONDS);
           System.out.println("Switched to window with name 1: " + winAppDriver.getTitle());
           return winAppDriver;
       }
   }
   return null;
}

This is the method that switches the control to another application window. This method takes the name of the window as a parameter and returns a WindowsDriver object. Before getting into this method let me discuss the “root” window in desktop applications.

Here we will see the “root” window: In any window system every window is part of some other window, we call it the parent window. This makes a kind of hierarchy of the windows. And there is one window we can call the “root” window of this hierarchy. The “root” window is the screen surface, and other windows are children of it. This can be more clear with the two figures. The direct children of the root window are called top-level windows.

So the above Fig: 1 is the root window, which covers the whole screen; 2,3, and 4 are top-level windows; 5 and 6 are subwindows of 2.

Below Fig 2: Hierarchy of the windows: calculator and Excel file are the children of the root window.

Now break down the method into 3 parts.

Part 1:

It sets up the desired capabilities for Windows-based application automation. It creates a new instance of DesiredCapabilities, which is used to specify the desired capabilities or properties for the automation session. Here, the “app” capability is set to “root”. It also sets the “platform name” capability to “Windows”, indicating that the target platform for the automation is Windows. Finally, a new instance of WindowsDriver referred to as rootDriver, is created with the desired capabilities. A URL pointing to the server running locally on http://127.0.0.1:4723. We will use this driver instance to interact with the Windows application specified by the “app” capability.

Part 2:

It finds all the windows in the application by using an XPath query (/Pane/Window). It iterates over each window and retrieves its name using the get Attribute method. If the window name contains the specified name parameter, it proceeds with the following steps.

The method retrieves the native window handle (Handles are numbers that uniquely identify each window.) of the matching window as a hexadecimal string using the getAttribute method
It converts it to an integer.
The window handle is then converted back to a hexadecimal string and prefixed with “0x” to create a valid window handle.

Part 3:

A new set of DesiredCapabilities is generated. Configuring the “appTopLevelWindow” capability with the window handle value obtained in the preceding step. This set, which includes the acquired window handle and other essential capabilities, is then used to initialize a new WindowsDriver object named ‘winAppDriver.’ The initialization requires providing the URL of the WinAppDriver server and updating the capabilities. The resulting ‘winAppDriver’ now has control over the newly opened window.

Interaction with calculator & Excel workbook

We can inspect all the locators of the calculator like all input buttons, results in text boxes, etc. In Fig we can see the locators of the calculator, for example for button nine the locator is “Nine” and to read the result locator is “CalculatorResult” etc. To enter any number in the calculator I search for an element by name and reading the result from the calculator I am using an automation ID.

In the same way, we can locate all cells on the Excel workbook once we switch on it. And using those locators we can read operands and operators from Excel and use them whenever needed. In the figure, we can see that every cell in the sheet has its unique locator like “A2”, “ B2” etc.

Explore the Desktop Automation example using a Feature, Step, and Page File.

Feature File:

Scenario: Switching between calculator and excel workbook
Given I am on Calculator Windows App
When  I switch to 'InputFile' excel workbook
And   I read first number from sheet
And
   I read operator from the sheet
And
   I read second number from sheet
And
   I switch to calculator
And
   I do calculation as per the data which I read from excel
And   I
 switch to 'InputFile' excel workbook
Then  I save the result in excel workbook

I kept the feature file very simple. First of all, I am launching the calculator. Then I switched to “InputFIle” which is an Excel workbook that I kept open before the start of execution. Once I have control of the Excel workbook I can read the first row of data using the “name” locator. After that, I switch to a calculator, perform calculations on data read the output and again switch back to the Excel workbook. Following that, I save the result to the appropriate cell. Basically, I switch to an Excel workbook in which I have to pass the name of the file as a parameter. The following is to switch to a calculator I use “Calculator” as a parameter directly.

Step File:

Now in the step file, We can see in the following step that I called the method and passed the window name as a parameter that I have written in the above feature file. For the calculator, I have used a separate method.

@When("I switch to {string} excel workbook")
public void iSwitchToInputFileExcelWorkbook(String fileName) throws MalformedURLException, InterruptedException { switchToWindowPage.switchToOtherWindowApplication (fileName);
}
@And("I read first number from sheet")
public void iReadFirstNumberFromSheet() { switchToWindowPage.readOperand1(); }
@And("I switch to calculator")
public void iSwitchToCalculator() throws MalformedURLException, InterruptedException {
switchToWindowPage.switchToCalculator();
}

Page File:

These are the two methods that I have used to call the “switchToWindowWithName” method which I have in WinAppUtil

public void switchTootherWindowApplication(String fileName) throws MalformedURLException, InterruptedException {
sleep( millis: 2000);
winAppDriver WinAppUtil.switchToWindowWithName(fileName);
sleep( millis: 100);
}
public void switchToCalculator () throws MalformedURLException, InterruptedException {
winAppDriver=WinAppUtil.switchToWindowWithName("Calculator");
}

You can find the complete code at htps://github.com/kfale-spurqlabs/Desktop-Application-Testing-Switching-Between-The-Windows

Conclusion:

We all have been using desktop automation applications long before the web and mobile apps but testing these apps becomes very rare. Communities are not large enough, so it is hard to find solutions for our issues in automation. So, in this blog, I have tried to cover all the scenarios where a desktop automation applications tester needs to perform switching between windows or within the window. Hope you will find it useful. For me, desktop applications have been a nightmare earlier but when I started working with them they seem like a piece of cake now! Hope we all can overcome all of the challenges in desktop automation testing with good preparation, strategy, and the team’s test automation skills.

Cypress Testing Best Practices and Tips for Assertions Techniques

by Harishchandra Ekal | Feb 22, 2024 | Blog

“Cypress Testing – Assertions Techniques Best Practices and Tips” focuses on enhancing the efficiency and effectiveness of test assertions in Cypress, a popular JavaScript end-to-end testing framework.

Cypress testing plays a crucial role in ensuring the reliability and correctness of web application tests. Developers and testers use these to validate expected outcomes, allowing them to assert conditions about the application’s state during test execution. Automation Testing.

We can summarise the key features of Assertions in Cypress Testing as:

Rich Assertions: Comprehensive checks for element properties (existence, visibility, text content, attributes).
Seamless Integration: Assertions smoothly blend into test syntax, improving readability and maintenance.
Automatic Retry: Robust handling of asynchronous tasks, minimizing test flakiness.
Expressive Tests: Empowers developers to create clear, comprehensive, and efficient tests, boosting confidence in the testing process.

Assertions in Cypress Testing:

Verify that an element exists in the DOM:

Syntax: .should(‘exist’)

Example:

<body> 
    <button id="myButton">Click me</button> 
</body> 
cy.get('button#myButton').should('exist');

Verify that an element does not exist in the DOM:

Syntax: .should(‘ not.exist ‘)

Example:

<body> 
    <button id="myButton">Click me</button> 
</body> 
cy.get('button#myButton').should(‘not.exist’)

Verify that an element is visible/Not Visible:

Syntax: .should(‘be.visible ‘) .should(‘not.be.visible ‘)

Example:

<button id="myButton">Visible Button</button> 
<button id="hiddenButton">Hidden Button</button> 
 
//To check visibility of element 
cy.get(' button#myButton ').should('be.visible') 
//To check invisibility of element   
cy.get('button#hiddenButton').should('not.be.visible');

Verify that an element is hidden:

Syntax: .should(‘be.hidden)

Example:

<body> 
    <div id="hiddenElement">This element is hidden</div> 
</body> 
cy.get('#hiddenElement').should('be.hidden');

Verify that an element has the expected value that the user has entered in the textbox:

Syntax: .should(‘have.value’, ‘expectedValue’)

Example:

<body> 
    <input type="text" id="myTextbox" /> 
</body> 
cy.get('#myTextbox').type("Hello SpurQLabs”); 
cy.get('#myTextbox').should('have.value', 'Hello SpurQLabs');

Verify that a string includes the expected substring:

Syntax: .should(‘include’, ‘expectedSubstring’)

Example:

<title>Cypress Example</title> 
    <body> 
        <div id="myText">This is some text content.</div> 
    </body> 
// Verify that the text content includes the expected substring 
const expectedSubstring = 'some text'; 
cy.get('#myText').should('include.text', expectedSubstring);

Verify that a string matches a regular expression pattern:

Syntax: .should(‘match’, /regexPattern/)

Example:

<title>Cypress Example</title> 
    <body> 
        <div id="myText">This is some text content.</div> 
    </body> 
// Verify that the text content matches a regular expression pattern 
const regexPattern = /some.*content/; 
cy.get('#myText').invoke('text').should('match', regexPattern);

Verify the length of an array or the number of elements matched:

Syntax: .should(‘have.length’, expectedLength)

Example:

<title>Cypress Example</title> 
    <body> 
        <ul id="myList"> 
            <li>Item 1</li> 
            <li>Item 2</li> 
            <li>Item 3</li> 
        </ul> 
    </body> 
const expectedLength = 3; 
cy.get('#myList li').should('have.length', expectedLength);

Verify if the element is focused:

Syntax: .should(‘have.focus’)
.should(‘be.focused’)

Example:

<title>Cypress Example</title> 
    <body> 
        <input type="text" id="myInput" /> 
    </body> 
// Focus on the input element 
   cy.get('#myInput').focus();  
// Verify that the input element is focused 
   cy.get('#myInput').should('have.focus'); 
   cy.get('#myInput').should('be.focused ');

Verify the title of the page:

Syntax: .title().should(‘include’, ‘Example Domain’)

Example:

<title>Cypress Example</title> 
const expectedTitle = 'Cypress Example'; 
cy.title().should('eq', expectedTitle); 
cy.title().should('include', expectedTitle )

Verify the URL:

Syntax: .url().should(‘eq’, ‘https://www.spurqlabs.com’);

Example:

cy.visit('https://www.spurqlabs.com'); 
const expectedURL = 'https://www.spurqlabs.com'; 
cy.url().should('include', expectedURL);

Verify multiple assertions at a time:

Example:

cy.get('h1') 
.should('exist')  // Assertion 1: Check if the h1 element exists 
.and('be.visible') // Assertion 2: Check if the h1 element is visible 
.and('have.text', 'Example Domain'); // Assertion 3: Check if the h1 element has the expected text

Property Assertion in Cypress Testing

Verify that an element has the expected attribute value:

Syntax: .should(‘have.attr’, ‘attributeName’, ‘expectedValue’)

Example:

<title>Cypress Example</title> 
<body> 
      <div id="myElement" data-custom-attribute="example-value">Some content</div> 
</body> 
const attributeName = 'data-custom-attribute'; 
const expectedValue = 'example-value'; 
cy.get('#myElement').should('have.attr', attributeName, expectedValue); 
});

Verify that an element has a specific CSS property with the expected value:

Syntax: .should(‘have.css’, propertyName, Value)

Example:

<title>Cypress Example</title> 
    <style> 
        #myElement { 
            color: blue; 
            font-size: 16px; 
        } 
    </style> 
    <body> 
        <div id="myElement">Styled element</div> 
    </body> 
const propertyName = 'color'; 
const expectedValue = 'blue'; 
cy.get('#myElement').should('have.css', propertyName, expectedValue);

Verify that an element has the expected text content:

Syntax: .should(‘have.text’, expectedText)

Example:

<title>Cypress Example</title> 
    <body> 
        <div id="myElement"> SpurQLabs </div> 
    </body> 
const expectedText = 'SpurQLabs'; 
cy.get('#myElement').should('have.text', expectedText);

Verify that an input element has the expected value:

Syntax: .should(‘have.value’, expectedValue )

Example:

<title>Cypress Example</title> 
    <body> 
        <input type="text" id="myInput" value="SpurQLabs" /> 
    </body> 
const expectedValue = 'SpurQLabs'; 
cy.get('#myInput').should('have.value', expectedValue);

Verify that a given value is NaN, or “not a number”:

Syntax: .should(‘be.a.NaN’)

Example:

// Some operation that results in NaN, for example, dividing by zero  
const result = 1 / 0; 
// Verify that the result is NaN 
1) cy.wrap(result).should('be.a.NaN'); 
2) cy.wrap(result).then(value => { 
     expect(value).to.not.be.NaN; 
   });

Verify an element or collection of elements is empty:

Syntax: .should(‘be.empty’)
.should(‘not.be.empty’)

Example:

<body> 
        <div id="emptyElement"></div> 
        <div id="nonEmptyElement">Some content</div> 
</body> 
// Verify that the empty element is empty 
cy.get('#emptyElement').should('be.empty'); 
 
// Verify that the non-empty element is not empty 
cy.get('#nonEmptyElement').should('not.be.empty');

Verify if a checkbox or radio button is checked:

Syntax: .should(‘be.checked’)

Example:

<body> 
      <input type="checkbox" id="myCheckbox" checked /> 
      <input type="radio" id="myRadioButton" checked /> 
</body> 
cy.get('#myCheckbox').should('be.checked'); 
cy.get('#myRadioButton').should('be.checked');

Verify if a checkbox or radio button is not checked:

Syntax: .should(‘not.be.checked’)

Example:

cy.get('#myCheckbox').should('not.be.checked'); 
cy.get('#myRadioButton').should('not.be.checked');

Verify if it is an array:

Syntax: .should(‘be.an’,’array’)

Example:

const myArray = [1, 2, 3]; 
cy.wrap(myArray).should(value => { 
   expect(value).to.be.an('array'); 
  }); 
cy.get('myArray’).should(‘be.an’,’array’)

Verify if an object has specific keys:

Syntax: .should(‘have.keys’,[‘id’,’name’,’email’])

Example:

const myObject = { 
            key1: 'value1', 
            key2: 'value2', 
            key3: 'value3' 
        }; 
// Define the expected keys 
const expectedKeys = ['key1', 'key2', 'key3']; 
 
 // Verify that the object has the expected keys 
  cy.wrap(myObject).should(obj => { 
    expect(Object.keys(obj)).to.deep.equal(expectedKeys); 
   }); 
cy.get('#object’).should('have.keys', ['id', 'name', 'email'])

Verify if a value is one of a specific set of values:

Syntax: .should(‘be.oneOf’,[‘value1′,’value2′,’value3’])

Example:

cy.get(''YourCSSelement ').should('be.oneOf', ['red', 'green', 'blue'])

Verify that a numeric value is within a certain range of another value:

Syntax: .should(‘be.closeTo’, expectedValue, delta))
.should(‘be.within’, Start range, End range);

Example:

const actualValue = 15; 
const referenceValue = 10; 
const range = 5; 
 
// Verify that the actual value is within the specified range of the reference value 
cy.wrap(actualValue).should('be.within', referenceValue - range, referenceValue + range); 
    }); 
// Verify that the actual value is close to the expected value within the specified delta 
cy.wrap(actualValue).should('be.closeTo', 50, 2); // Verify that actual value is close to (50-2) to (50+2) i.e. 48 to 52.

Verify Object assertion:

Syntax: .should(‘have.property’, ‘propertyName’,’actualPropertyValue’)

Example:

const expectedObject = { 
name: 'Tom Cruise’, 
age: 32, 
city: ‘London’ 
} 

cy.wrap(expectedObject) 
.should('have.property', 'name', expectedObject.name) 
.should('have.property', 'age', expectedObject.age) 
should('have.property', 'city', expectedObject.city)

Check is() block Assertions:

In the context of Cypress Testing, the .is() block typically utilizes conditions that check various states or attributes of an element. Here are some examples of selectors and conditions you might use inside the .is() block:

Check if an element is visible:

if($element.is(':visible')){ 
// Code to execute when the element is visible 
}

Check if a button or input is enabled:

if ($element.is(':enabled')) { 
// Code to execute when the element is enabled 
}

Check if an input field is readonly:

if ($element.is('[readonly]')) { 
// Code to execute when the element is readonly 
}

Check if an element contains specific text:

if ($element.is(':contains("Some Text")')) { 
// Code to execute when the element contains the specified text 
}

Check if an element has a specific attribute value:

if ($element.is('[data-type="value"]')) { 
// Code to execute when the element has the specified attribute value 
}

Create custom conditions based on your specific requirements:

if ($element.is('.custom-class')) { 
// Code to execute when the element has a specific class 
}

Conclusion:

Cypress Testing with its rich set of functionalities and integration benefits, empowers developers to create expressive and comprehensive tests. The combination of these features fosters a more efficient and confident testing process, ultimately contributing to the overall reliability of web applications.

Harishchandra Ekal

Harish is an SDET with expertise in API, web, and mobile testing. He has worked on multiple Web and mobile automation tools including Cypress with JavaScript, Appium, and Selenium with Python and Java. He is very keen to learn new Technologies and Tools for test automation. His latest stint was in TestProject.io. He loves to read books when he has spare time.

How to Automate Chrome Extension using selenium?

by Sumit Kunjir | Feb 21, 2024 | Blog

Introduction to Automate Chrome Extension:

Over the years, the landscape of software testing has gradually developed from a predominantly manual testing phase to an increasing accentuation on automated/automation testing. In your career path as a test engineer, you will inevitably bump into automation testing. In the current landscape of the software industry, clients seek frequent and repetitive deployments. If you are in a role of Quality Assurance, you are likely to encounter and test systems needing frequent requirement changes or the rapid introduction of new and progressive requirements. Such a dynamic landscape calls for a constant adaptation to frequent code changes within stiff deadlines. A challenge that we can effectively address by adopting automation testing methodologies.

Why to Automate Chrome Extension:

We often use Chrome extensions in our daily activities, which is crucial for enhancing productivity. The repetitive nature of certain tasks associated with these extensions can become monotonous over time. This blog aims to guide you through the process of automating Chrome extensions and executing click actions using Selenium, a widely acclaimed open-source test automation framework introduced in 2004. If you find yourself needing to use a particular extension regularly, the conventional method involves manually adding the extension to Chrome and performing the same task repeatedly. This manual repetition not only increases effort but also consumes valuable time. Therefore, to streamline this process and save both manual effort and time, we present a precise method to automate Chrome extensions and configure them seamlessly for efficient use.

How to Automate Chrome Extension:

In this article, we will learn the process of Automate Chrome extensions and performing click actions using the Selenium WebDriver and about the Robot Class in Selenium. We will examine them in the Chrome browser using Java. Here we go !!

Before moving on to the main topic of our discussion, let’s quickly review the techniques we will use to Automate Chrome extension and conduct action.

Prerequisite:

Technologies	Download Link
IntelliJ idea IDE	https://www.jetbrains.com/idea/download/#section=windows
Maven	https://maven.apache.org/download.cgi
Java JDK – 11	https://www.oracle.com/in/java/technologies/javase/jdk11-archive-downloads.html
Cucumber-java – 7.11.0	https://mvnrepository.com/artifact/io.cucumber/cucumber-java/7.11.0
Cucumber-core – 7.11.0	https://mvnrepository.com/artifact/io.cucumber/cucumber-core/7.11.0
Selenium-java – 4.8.0	https://mvnrepository.com/artifact/org.seleniumhq.selenium/selenium-java/4.8.0
Webdrivermanager – 5.3.0	https://mvnrepository.com/artifact/io.github.bonigarcia/webdrivermanager/5.3.1
Mofiki’s Coordinate Finder	https://www.softpedia.com/get/Desktop-Enhancements/Other-Desktop-Enhancements/Mofiki-s-Coordinate-Finder.shtm
Apache Maven – 3.9.0	https://maven.apache.org/download.cgi

Implementation Steps to Automate Chrome Extension:

Add Calculator extension to the local Chrome browser.
Pack the extension and create a .crx file in File Explorer.
Create a Maven project using IntelliJ IDE.
Add dependencies in POM.xml file and Add .crx file in resources package.
Create Packages and files in the project.
- 5.1. Creating Features File.
- 5.2. Creating Steps file.
- 5.3. Creating Page Object Design Pattern.
- 5.4. Creating TestContext File.
- 5.5. Creating BaseStep File.
Conclusion

I intend to use simplified language while articulating the concepts. So, let’s dive into the core of our topic, which is how to Automate Chrome extensions and perform click actions using Selenium.

To do this, we will follow a few rules, which I have depicted below as steps.

Step 1: Add Calculator extension to the local Chrome browser.

In this article, we are going to use the Calculator extension to Automate Chrome extension and perform an action on an extension out of the DOM element.

To add calculator extension to local Chrome browser –

Go to Chrome browser and navigate to the extension page or visit the link below:
https://chrome.google.com/webstore/category/extensions?hl=en
On the Extension page find the search box and search for the calculator extension.
Select calculator first and then download and click on add extension

After adding an extension, visit chrome://extensions/ URL from the address bar and then enable the Developer mode.

Also on this site, we can see our calculator extension which we just added.

On an extension, there could be an Extension ID. We have to note down this extension ID. In the next step, we will learn about generating a folder named extension ID in File Explorer.

In this article Extension ID is hcpbdjanfepobbkbnhmalalmfdmikmbe

Congratulations, we have completed our first step of Adding the Calculator extension to the local automate Chrome browser.

Now let’s begin with the next step.

Step 2: Pack the extension and create a .crx file in File Explorer

Before continuing with the second step we will learn what a .crx file extension is.

What is a .crx file extension?

A Chrome extension file has a .crx extension. It increases the functionality of the Google Chrome web browser by allowing third-party applications to supplement its basic functionality.

Now, we will learn how to pack the calculator extension and generate a .crx file extension.

After adding the calculator extension to the local Chrome browser, the file explorer will generate a folder with the name extension ID (hcpbdjanfepobbkbnhmalalmfdmikmbe).

Follow the provided path to locate the extension folder –

(to locate AppData we have to enable show hidden folders)
C:➜Users➜{UserName}➜AppData➜Local➜Google➜Chrome➜User➜Data➜Default➜Extensions➜hcpbdjanfepobbkbnhmalalmfdmikmbe➜1.8.2_0

In the extension folder, we will find the folder named Extension ID, which we have noted down here hcpbdjanfepobbkbnhmalalmfdmikmbe is the Extension ID for calculator extension. Open that folder.

In the folder, we can see a version folder of the extension. Open that folder ➜1.8.2_0

Now we have to copy the path as mentioned in below image –

We will use this path to pack the extension in next steps.

Now, launch the Chrome browser and Visit chrome://extensions/ in the address bar

Here we can see the pack extension option.

➜ Click on Pack Extension to automate chrome extension

After visiting the page we will be able to see the Pack Extension option as shown in the below image.

Here we have to type or paste the path that we had copied.

➜Add copied path to the Extension root directory

In this step, we have to paste a copied path to the Extension root directory to pack our Extension and then we have to click on the Pack Extension Button.

➜Copy the path of the .crx file

After clicking on the Pack Extension button a pop-up frame will appear. Here, we can see the path of the .crx file where it has been generated in File Explorer. Remember the path of the .crx file and click on the OK button.

➜ Navigate to the .crx file in file explorer

Now let’s navigate to the path of the .crx file as mentioned in the step above . Once we navigate to the path of the .crx file we can see the file has been generated. We have to use this .crx file in our maven project to display it in the selenium web driver and perform actions on it.

Congratulations!! We have successfully generated a .crx file.

Step 3: Create a Maven project using Intellij IDE.

Before creating a Maven project. Let’s understand what Maven is.

What is Maven?

Maven is a Java project management tool that the Apache Software Foundation developed. It is written in Java Language to build projects written in C#, Ruby, Scala, and other languages. It allows developers to create projects, dependencies and documentation using Project Object Model and plugins.

Why do we use Maven?

Maven is the latest build testing tool and a project management tool.
It makes the build process very easy (No need to write long scripts).
It has a standard directory structure which is followed.
It follows Convention over Configuration.
It has a remote maven repository with all the dependencies in one place that can be easily downloaded.
Can be used with other programming languages too, just not Java.

Hope, this now gives a clear view of Maven. Now let’s create a new Maven project using Intellij Idea IDE.

Open your IntelliJ IDE and go to the File ➜ New ➜ Project as shown in the below image.

A new project pop-up will be displayed on the screen, and we must enter the project’s details here.

Details required to create the Maven project are:

Name: Provide a suitable name as per your requirement.
Location: Choose the location where you want to store your project.
Language: Choose the programming language as per your requirement.
Build System: Here you have to choose Maven.
JDK: Choose the JDK you want to use. (Note: Maven uses a set of identifiers, also called coordinates, to uniquely identify a project and specify how the project artifact should be packaged.)
GroupId: a unique base name of the company or group that created the project
ArtifactId: a unique name for the project.

Simply, click on the Create button and the Maven project will be created.

After successfully creating the project we can see the structure of the Maven project. Some default files have been created as given in the image below.

Yes !! We have successfully created our Maven project. Let’s move ahead.

Step 4: Add dependencies in POM.xml file and Add .crx file in the resources package.

We shall include Maven dependencies in your project using IntelliJ IDEA. These dependencies need to be mentioned in our pom.xml file for our project build-up.

Below are the dependencies that we need to add to the pom.xml file.

<dependencies>
   <dependency>
       <groupId>io.cucumber</groupId>
       <artifactId>cucumber-java</artifactId>
       <version>7.11.0</version>
   </dependency>
   <dependency>
       <groupId>io.cucumber</groupId>
       <artifactId>cucumber-core</artifactId>
       <version>7.11.0</version>
       <scope>compile</scope>
   </dependency>
   <dependency>
       <groupId>org.seleniumhq.selenium</groupId>
       <artifactId>selenium-java</artifactId>
       <version>4.8.0</version>
   </dependency>
   <dependency>
       <groupId>io.github.bonigarcia</groupId>
       <artifactId>webdrivermanager</artifactId>
       <version>5.3.0</version>
   </dependency>
</dependencies>

selenium-java: Selenium WebDriver library for Java language binding
cucumber-java: Cucumber JVM library for Java language binding.
webdrivermanager: library to automatically manage and set up all the drivers of all browsers which are in test scope.

After adding dependencies in the pom.xml file we have to add the .crx file to the resources directory, .crx file is the file that we have generated in step 2.

To add the .crx file to the resources directory, copy the file from the file explorer and paste it into the resources directory. We can also rename the .crx file as we want.

For renaming the file, right-click on the file ➜ select the refactor option ➜ then click on the rename option.

As shown in the above image, the rename pop-up will flash on the screen. Here we can give the file name as desired.

Here in this project, I am renaming the .crx file with the CalculatorExtension.crx file.

Step 5: Create Packages and files in the project to automate chrome extension.

After adding dependencies to the pom.xml file. We have to create a BDD framework that includes packages and files. Before moving ahead, let’s first get an overview of the Cucumber BDD framework.

What is the Cucumber Behavior Driven Development (BDD)Framework?

Cucumber is a Behavior Driven Development (BDD) framework tool for writing test cases. It is a testing tool that supports Behavior Driven Development (BDD). It offers a way to write tests that anybody can understand, regardless of their technical knowledge. In BDD, users (business analysts and product owners) first write scenarios or acceptance tests that describe the behavior of the system from the customer’s perspective. These scenarios and acceptance tests are then reviewed and approved by the product owners. The Cucumber framework uses Ruby as programming language.

To manage our code files for the project we need to create packages that are as follows:

Features Package – All feature files are contained in this package.
Steps Package – All step definition files are included in this package.
Pages Package – All page files are included in this package.
Utilities Package – All configuration files are included in this package.

Now, we have to create a feature file,

5.1: Creating Features File:

Features file contains a high-level description of the Test Scenario in simple language. It is known as Gherkin. Gherkin is a plain English text language

Cucumber Feature File consists of following components –

Feature: We use “Feature” to describe the current test script that needs execution.
Scenario: We use Scenario to describe the steps and expected outcome for a particular test case.
Given: We use “Given” to specify the context of the text to be executed. We can parameterize steps by using data tables “Given”
When: “When” indicates the test action that we have to perform.
Then: We represent the expected outcome of the test with “Then”

We need to add the below code in the feature file for our project.

Feature: Calculator Extension
 Scenario: User want to add 2 number by using calculator extension
   Given User clicks on the extension icon
   And User clicks on the calculator extension
   When User clicks on number 9
   And User clicks on "+" operator
   And User clicks on number 2
   And User clicks on "=" operator
   Then User sees the result 11

According to the above feature file, we are adding two numbers. To open the Chrome WebDriver and add a calculator extension, we use a GIVEN file. With the use of ‘WHEN’ and ‘AND’ annotations, we are executing click actions on the calculator extension, with which we are adding two numbers from the calculator. In the final step, we are using the ‘THEN’ annotation to verify the result (the addition of two numbers).

5.2: Creating Steps file.

Steps Definition to automate chrome extension-

Step definition maps the Test Case Steps in the feature files (introduced by Given/When/Then) to code. It executes the steps on Application Under Test and checks the outcomes against expected results. For a step definition to execute, it requires matching the “Given” component in a Feature.

Here in the step file, we are mapping the steps from the feature file. In simple words, we are making a connection between the steps of the feature file and with step file. While mapping the steps we have to take care about the format of mapping the steps in step definition. We need to use the below format to map the steps for the feature we had created in the features file.

package Steps;
import Pages.CalculatorPage;
import Utilities.TestContext;
import io.cucumber.java.en.And;
import io.cucumber.java.en.Given;
import io.cucumber.java.en.Then;
import io.cucumber.java.en.When;

import java.awt.*;

public class CalculatorStep extends TestContext {

   CalculatorPage calculatorPage;

   public CalculatorStep() throws AWTException { calculatorPage = new CalculatorPage(driver); }

   @Given("User clicks on the extension icon")
   public void User_clicks_on_the_extension_icon() throws InterruptedException {
       calculatorPage.User_clicks_on_the_extension_icon();
   }
   @And("User clicks on the calculator extension")
   public void userClicksOnTheCalculatorExtension() throws InterruptedException {
       calculatorPage.userClicksOnTheCalculatorExtension();
   }
   @When("User clicks on number {int}")
   public void User_clicks_on_number(int number) throws InterruptedException {
       calculatorPage.User_clicks_on_number(number);
   }

   @And("User clicks on {string} operator")
   public void userClicksOnOperator(String operator) throws InterruptedException {
       calculatorPage.userClicksOnOperator(operator);
   }

   @Then("User sees the result {int}")
   public void userSeesTheResult(int result) throws InterruptedException {
       calculatorPage.userSeesTheResult(result);
   }
}

5.3: Creating Page Object Design Pattern

Till now we have successfully created a feature file and a step file. Now in this step, we will be creating a page file. Page file contains all the logic of the test cases. Generally, in Web automation, we have page files that contain the locators and the actions to perform on the web elements but in this framework, we are not using the locators because as we know extension is not in the DOM(Document Object Model) element as it is outside the DOM element. So we will only create the methods and for those methods, we will be using Robot class and X and Y coordinates.

Here in this code, we are performing the activities that are hovering by the mouse actions(move, press, release), clicking on the calculator extension, clicking on the two numbers from the calculator, clicking on the calculator’s “+” addition operator, and obtaining the result of the addition of those two numbers.

What is the Robot Class in Selenium?

Robot Class in Selenium is used to enable automated testing for implementations of the Java platform. It generates input events in native systems for test automation, self-running demos, and other applications where users need control over the mouse and keyboard. Selenium Webdriver was unable to handle these pop-ups or applications and extensions. So a robot class was introduced in Java versions 1.3 and above, that can handle OS pop-ups or applications and extensions.

Robots help in managing all the activities like performing the task within the specified time, handling the mouse functions and the keyboard functions, and many more

While we are using the robot class, it requires the x and y coordinates of the element of the screen on which we will be performing the actions i.e hovering the cursor and then performing click actions.To find the coordinates we are using the Mofiki’s Coordinate finder.

What is Mofiki’s Coordinate Finder?

Mofiki’s Coordinate Finder finds out the present x and y coordinates of our cursor by hovering the mouse anywhere on the screen with the help of the application Mofiki’s Coordinate Finder, which is available for free download.

Steps to download and use Mofiki’s Coordinate Finder:-

Visit the link below to download Mofiki’s Coordinate Finder application zip file.
https://www.softpedia.com/get/Desktop-Enhancements/Other-Desktop-Enhancements/Mofiki-s-Coordinate-Finder.shtml
Download the zip file of Mofiki’s Coordinate Finder application
Extract the Mofiki’s Coordinate Finder zip file and install the application setup.
Now open the Mofiki’s Coordinate Finder application. Use the image below as an illustration.

Now to find the x and y coordinates move the cursor to the point and just press the space bar we can get the x and y coordinates

package Pages;
import Utilities.TestContext;
import org.openqa.selenium.WebDriver;
import java.awt.*;
import java.awt.event.InputEvent;
import java.util.Objects;
public class CalculatorPage extends TestContext {
   public CalculatorPage(WebDriver driver) throws AWTException {
       TestContext.driver = driver;
   }
   public Robot robot = new Robot();

   public void User_clicks_on_the_extension_icon() throws InterruptedException {
       Thread.sleep(5000);
       robot.mouseMove(1250, 48);
       Thread.sleep(2000);
       robot.mousePress(InputEvent.BUTTON1_DOWN_MASK);
       robot.mouseRelease(InputEvent.BUTTON1_DOWN_MASK);
   }

   public void userClicksOnTheCalculatorExtension() throws InterruptedException {
       Thread.sleep(3000);
       robot.mouseMove(1000, 190);
       Thread.sleep(2000);
       robot.mousePress(InputEvent.BUTTON1_DOWN_MASK);
       robot.mouseRelease(InputEvent.BUTTON1_DOWN_MASK);
       Thread.sleep(10000);
   }

   public void User_clicks_on_number(int number) throws InterruptedException {
       Thread.sleep(3000);
       if(number == 0){robot.mouseMove(964, 450);}
       else if (number == 1) {robot.mouseMove(969, 390);}
       else if (number == 2) {robot.mouseMove(1020, 391);}
       else if (number == 3) {robot.mouseMove(1087, 395);}
       else if (number == 4) {robot.mouseMove(964, 334);}
       else if (number == 5) {robot.mouseMove(1023, 336);}
       else if (number == 6) {robot.mouseMove(1080, 334);}
       else if (number == 7) {robot.mouseMove(965, 282);}
       else if (number == 8) {robot.mouseMove(1023, 278);}
       else if (number == 9) {robot.mouseMove(1089, 278);}
       else { System.out.println("Number Invalid"); }
       Thread.sleep(2000);
       robot.mousePress(InputEvent.BUTTON1_DOWN_MASK);
       robot.mouseRelease(InputEvent.BUTTON1_DOWN_MASK);
       Thread.sleep(10000);
   }

   public void userClicksOnOperator(String operator) throws InterruptedException {
       Thread.sleep(3000);
       if(Objects.equals(operator, "+")){robot.mouseMove(1139, 446);}
       else if (Objects.equals(operator, "-")) {robot.mouseMove(1137, 386);}
       else if (Objects.equals(operator, "*")) {robot.mouseMove(1137, 331);}
       else if (Objects.equals(operator, "/")) {robot.mouseMove(1138, 272);}
       else if (Objects.equals(operator, "=")) {robot.mouseMove(1199, 446);}
       else {System.out.println("Invalid Operator");}
       Thread.sleep(2000);
       robot.mousePress(InputEvent.BUTTON1_DOWN_MASK);
       robot.mouseRelease(InputEvent.BUTTON1_DOWN_MASK);
       Thread.sleep(10000);
   }

   public void userSeesTheResult(int result) {
       System.out.println("Result should be "+ result);
   }
}

5.4: Creating TestContext File.

Now, In the Utilities package we have to create a TestContext file in which we can declare a webdriver. Declaring the webdriver as public allows initialization in every class file after inheriting the TestContext class. The step file and page file inherit the testContext class file. Also, we have declared Robot class here.

package Utilities;

import org.openqa.selenium.WebDriver;

import java.awt.*;

public class TestContext {
   public static WebDriver driver;

   public static Robot robot;
}

5.5: Creating BaseStep File:

This step is very important because we will be creating an environment file (i.e. Hooks file) and also we are using Chrome Options to add Calculator extensions.

Before moving ahead let’s understand about Before and After Hook and Chrome Options

What is Before and After Hooks?

Hooks allow us to better manage the code workflow and help us reduce code redundancy. We can say that it is an unseen step, which allows us to perform our scenarios or tests.

@Before – Before hooks run before the first step of each scenario.

@After – Conversely After Hooks run after the last step of each scenario even when steps fail, are undefined, pending, or skipped.

What are Chrome Options?

For managing different Chrome driver properties, Selenium WebDriver has a concept called the Chromeoptions Class. For modifying Chrome driver sessions, the Chrome options class is typically combined with Desired Capabilities. Eventually it enables you to carry out numerous tasks, such as launching Chrome in maximized mode, turning off installed extensions, turning off pop-ups, etc.

At this instant we have to create Before and After Hooks. At the same time each hook should contain a void method as shown in the below code.

In the Before Hook, we have to initialize the webdriver. Also, we have to add simple lines of code to add extensions to the webdriver. To add the extensions we are using Chrome Options to Automate Chrome Extension. Then in the After Hook, we are closing the webdriver.

Now, we have to create a Base Step which should have driver configuration and hooks

package Utilities;

import io.cucumber.java.After;
import io.cucumber.java.Before;
import io.github.bonigarcia.wdm.WebDriverManager;
import org.openqa.selenium.chrome.ChromeOptions;

import java.io.File;

public class BaseStep extends TestContext{
   @Before
   public void initCucWeb()
   {
       ChromeOptions options = new ChromeOptions();
       options.addExtensions(new File("src/test/Resources/CalculatorExtension.crx"));
       options.addArguments("--remote-allow-origins=*");
       options.setCapability(ChromeOptions.CAPABILITY,options);
       driver = WebDriverManager.chromedriver().capabilities(options).create();
       driver.manage().window().maximize();
   }
   @After
   public void closeWeb()
   {
       driver.quit();
   }
}

Please find attached the GitHub repository link. I have uploaded the same project to this repository. I have also attached a Readme.md file that explains the framework and the different commands we have used so far in this project.

https://github.com/spurqlabs/ChromeExtensionAutomation.git

Conclusion:

It is a very difficult task to add an extension to a web driver and perform an action on extension icons. So basically, in this article, we have found a solution to add an Automate Chrome Extension to Webdriver and to perform a Click action on the extension icon apart from learning to Automate Chrome extension using the Selenium Webdriver.

The software testing landscape has evolved towards automation to meet the demands for quick and frequent deployments, adapting efficiently to constant updates and tight deadlines in a dynamic development environment.

Sumit Kunjir

I am an SDET Engineer proficient in manual, automation, API, Performance, and Security Testing. My expertise extends to technologies such as Selenium, Cypress, Cucumber, JMeter, OWASP ZAP, Postman, Maven, SQL, GitHub, Java, JavaScript, HTML, and CSS. Additionally, I possess hands-on experience in CI/CD, utilizing GitHub for continuous integration and delivery. My passion for technology drives me to constantly explore and adapt to new advancements in the field.

How to trigger a workflow from another workflow using GitHub Action

by Yogesh Dhole | Dec 6, 2023 | Blog

GitHub Actions has revolutionized the way developers and testers automate their workflows. With Actions, developers can easily define and customize their CI/CD processes, enhancing productivity and code quality. One of the powerful features of GitHub Actions is the ability to trigger workflows from another workflow GitHub Actions. In this article, we will delve into the intricacies of mastering GitHub Actions and explore how to trigger workflows from other workflows.

Understanding GitActions and their Benefits:

GitHub Actions is a powerful automation framework integrated into GitHub. It allows developers and testers to define custom workflows composed of one or more jobs, each consisting of various steps. These workflows can be triggered based on events such as push and pull requests, commits, or scheduled actions. The benefits of using GitHub Actions include faster development cycles, improved collaboration, and streamlined release processes.

Defining workflow in GitHub Actions:

Before we delve into triggering workflows, let’s define what a workflow is in GitHub Actions. A workflow is a configurable automated process that runs on GitHub repositories. It consists of one or more jobs, each defining a set of steps. These steps can perform tasks such as building, testing, and deploying code.

Overview of triggering workflows from another workflow using GitHub Action:

It is important to understand workflow dependencies to trigger a workflow from another workflow. Workflow dependencies refer to the relationships between different workflows, where one workflow triggers the execution of another workflow. By leveraging workflow dependencies, developers and testers can create a seamless and interconnected automation pipeline.

In complex development scenarios, there is often a need to trigger workflows based on the completion of other workflows. This can be particularly useful when different parts of the development process depend on each other and when different teams collaborate on a project. By triggering workflows from related workflows, developers and testers can automate the execution of dependent tasks, ensuring a smoother development workflow.

The advantages of workflow interdependency are numerous. Firstly, it allows for a modular and reusable approach to workflow automation. Instead of duplicating steps across different workflows, developers, and testers can encapsulate common operations in one workflow and trigger it from others. This promotes code reusability, reduces maintenance efforts, and enhances overall development efficiency. Moreover, workflow interdependency enables better collaboration between teams working on different aspects of a project, ensuring a seamless integration between their workflows.

GitHub Action Prerequisites:

A GitHub repository having a workflow defined in it (repository_01)
Another GitHub repository (repository_02) has a workflow defined in it that triggers after repository_01 workflow completion.
GitHub personal access token

As we have all the required stuff for our goal then let’s get it done. First will understand about GitHub personal access token.

GitHub Personal Access Token (PAT):

Personal access tokens are an alternative to using passwords to authenticate GitHub when using the GitHub API or the command line. Personal access tokens are intended to access GitHub resources on your behalf.

To learn more about GitHub personal access tokens visit the official website of GitHub https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/managing-your-personal-access-tokens

How to Generate PAT?

1: First, Access your GitHub account by logging in.

2: Navigate to your profile, click on “Settings,” and proceed to “Developers.”

3: Click on Personal Access Token and then Select Token Classic.

4: Navigate to and choose “Generate new token,” then select Generate new token Classic.

5: Here, we Include a note for your Access Token (PAT) – it’s optional. Choose the expiration date for your PAT. Select the scope and at last click on generate token. Copy the token and paste it on a notepad.

(Remember the selected scope will decide the permissions and authorization to access another repository and workflow)

So now we need to add the generated PAT to our repository_01 as a secret to do this follow the below steps.

To navigate to your repository, you can click on the settings.

Then go to secrets and variables then select the Action button.

Select the repository secret, add PAT_TOKEN in the name, and paste the copied personal access token in the value. Click on Add Secret.

Workflow Creation (repository_01):

To create a workflow head over to the action tab and click on new workflow. Then select Set up workflow yourself. Now customize your workflow and add the below step to trigger the (repository_02) workflow.

name: Workflow01
on:
  push:
    branches:
      - main
jobs:
  build:
    runs-on: ubuntu-latest
    timeout-minutes: 600

    steps:
       - name: Checkout to repository
      - uses: actions/checkout@v3

      # Your existing workflow steps
      #  At the end of your all steps add the below step
          
  trigger-workflow02:
    needs: build
    runs-on: ubuntu-latest
    steps:
      - name: Trigger Workflow02
        uses: peter-evans/repository-dispatch@v2
        with:
          token: ${{ secrets.PAT_TOKEN }}
          repository: username/repository_02 name
          event-type: trigger-workflow02

Let’s understand the trigger-workflow02 stage. Following is the secret we have added is used here to provide the permissions and the authorization to understand and trigger the workflow_02 of repository_02 also replace the username with your GitHub username and repository_02 name with your other repository name.

Workflow Creation (repository_02):

As our first workflow is ready now let’s create our second workflow for repository_02. Follow the same steps described in the above step for the creation of a workflow.

name: Workflow02
on:
  repository_dispatch:
    types:
      - trigger-workflow02
jobs:
  build-and-test:
    runs-on: ubuntu-latest

    steps:
    - uses: actions/checkout@v2
   # your existing steps of workflow

Now let’s understand what to consider here, first the triggering event is set as repository_dispatch means when the other repository is completed this workflow will get triggered and now to specify which repository we arousing types as trigger-workflow02 which is defined as a stage in the workflow01.

We are done this is how we can trigger the workflow02 of repository_02 when the execution of workflow01 of repository_01 is completed and the status is passed. Below are the output screenshots give it a check.

For Organization Account:

Till this point whatever we have seen it’s for our personal GitHub account and if we want to implement this concept for the organization’s GitHub account then we need to introduce a small change in the workflow01 of the repository_01.

name: Workflow01
on:
  push:
    branches:
      - main
jobs:
  build:
    runs-on: ubuntu-latest
    timeout-minutes: 600

    steps:
       - name: Checkout to repository
      - uses: actions/checkout@v3

      # Your existing workflow steps
      #  At the end of your all steps add the below step
          
  trigger-workflow02:
    needs: build
    runs-on: ubuntu-latest
    steps:
      - name: Trigger Workflow02
        uses: peter-evans/repository-dispatch@v2
        with:
          token: ${{ secrets.PAT_TOKEN }}
          repository: organization/repository_02 name
          event-type: trigger-workflow02

Let’s understand the trigger-workflow02 stage. The secret we have added is used here to provide the permissions and authorization to trigger the workflow_02 of repository_02 also replace the organization with your organization’s GitHub name and repository_02 name with your other repository name.

Conclusion:

In this blog, we have explored the powerful feature of trigger workflow from another workflow using GitHub Actions. By understanding workflow dependencies, leveraging workflow events and triggers, implementing remote triggers, and building scalable workflow chains, developers can enhance their CI/CD processes and workflow automation. To summarize, triggering workflows from another workflow allows for increased reusability, collaboration, and customization of automation processes. By embracing these features, developers can optimize their development workflows and empower their teams to achieve greater productivity and efficiency.

Test Automation using Cucumber, JavaScript, and Webdriver.IO

by Harishchandra Ekal | Oct 11, 2023 | Blog

Introduction:

In this blog, we have created the WebdriverIO framework, which will help to run test cases on web applications on different browsers. WebdriverIO is a popular open-source test automation framework for Node.js.Creating a test automation framework using Cucumber, JavaScript, and WebdriverIO offers several benefits that can streamline your testing process and improve the efficiency and maintainability of your automated tests. Here’s why you might want to consider using this combination:

1. BDD Approach with Cucumber:

Cucumber enables Behavior-Driven Development (BDD), allowing you to write test scenarios in a human-readable format.

2. JavaScript Language:

JavaScript is a widely used programming language for web development, making it accessible to many developers.

3. WebdriverIO as Test Automation Framework:

WebdriverIO is a popular JavaScript-based WebDriver framework that simplifies interactions with browsers and elements on web pages. It also provides a variety of built-in commands for browser automation, making test script development more efficient.WebdriverIO supports multiple testing frameworks, including Mocha and Jasmine, which can be integrated with Cucumber for BDD.

4. Cross-Browser Testing:

With WebdriverIO, you can quickly run your tests across different browsers and browser versions. This ensures that your application functions correctly and consistently across various browser environments.

5. Reusability and Maintainability:

The combination of Cucumber and WebdriverIO promotes the creation of reusable step definitions. Moreover, this modularity makes it easier to maintain test scripts and reduces duplication of code.

6. Parallel Execution:

WebdriverIO supports parallel test execution, which can significantly reduce the overall test execution time.

7. Community and Support:

Both Cucumber and WebdriverIO have active communities, which means you can find a wealth of resources, tutorials, and plugins to enhance your automation efforts.

Let’s see how webdriverIO works and Its process:

Pre-requisites:

1. Make sure you have Node.js installed on your system. You can download and install it from the official website: https://nodejs.org/en/

2. Open your terminal or command prompt and create a new directory for your WebdriverIO project or else create a folder wherever you want & open it in VSCode

3. VSCode

Initialize a new npm project by running the following command: “npm init wdio .” This will create WebdriverIO packages and their installation.

Once you execute that command you will get the following message:

“Need to install the following packages:

create-wdio@8.2.3

Ok to proceed? (y)”

If you proceed by pressing “y”, you will receive a list of instructions on how to generate the framework. You can follow these instructions to create the desired WebdriverIO framework.

Once you have completed the framework generation process, it will create a package.json file that will serve as a record of your project’s dependencies. This file will help you manage and keep track of the dependencies required for your project.

{
  "dependencies": {
    "@wdio/selenium-standalone-service": "^8.3.2"
  },
  "devDependencies": {
    "@wdio/cli": "^8.3.10",
    "@wdio/cucumber-framework": "^8.3.0",
    "@wdio/local-runner": "^8.3.10",
    "@wdio/spec-reporter": "^8.3.0",
    "chromedriver": "^110.0.0",
    "wdio-chromedriver-service": "^8.1.1"
  },
  "scripts": {
    "wdio": "wdio run ./wdio.conf.js"
  }
}

“Install WebdriverIO and its CLI tool by running the following command:

“npm install webdriverio @wdio/cli –save-dev”.

This will install WebdriverIO and its CLI tool as dev dependencies and save them in your package.json file.”

Package-json:

package.json is a file used in Node.js projects that contains metadata and configuration information for the project, as well as a list of dependencies and dependencies required for the project to run. It is located in the root directory of the project and is used by package managers such as npm (Node Package Manager) to install and manage dependencies.

Wdio.conf.js:

This file contains the configuration settings that define how the test automation framework runs and interacts with the web application being tested. It has Capabilities, Specs, Framework, Reporter, Hooks, Services, etc.

exports.config = {
    runner: 'local',
    specs: [
      './features/**/*.feature'
    ],
    // Patterns to exclude.
    exclude: [
        // 'path/to/excluded/files'
    ],
    maxInstances: 10,
    capabilities: [{
        maxInstances: 5,
        browserName: 'chrome',
        acceptInsecureCerts: true
    }],
acceptInsecureCerts: true   
  }],
    logLevel: 'info',
    bail: 0,
    baseUrl: 'https://www.calculator.net/',
    seleniumLoginUrl: 'https://demo.guru99.com/test/login.html',
    waitforTimeout: 10000,
    connectionRetryTimeout: 120000,
    connectionRetryCount: 3,
    services: ['chromedriver'],
    framework: 'cucumber',
    reporters: ['spec'],
    seleniumAddress: 'http://localhost:4444/wd/hub',
    cucumberOpts: {
        require: ['./features/step-definitions/*.js'],
        backtrace: false,
        requireModule: [],
        dryRun: false,
        failFast: false,
        snippets: true,
        source: true,
        strict: false,
        tagExpression: '',
        timeout: 60000,
        ignoreUndefinedDefinitions: false
    },
}

So here we have selected the “cucumber” framework which will help create test cases in BDD format. Before we go into framework details, you all should know that all WebdriverIO commands are asynchronous and need to be properly handled using async/await.

The Page Object Model (POM) is a popular design pattern used in software testing to represent web pages as objects and simplify the process of automated testing. The POM structure usually includes a “pageobjects” folder, which contains classes or files that represent individual pages on a website or application. These page object classes or files encapsulate the elements and actions related to a specific page, making writing and maintaining automated tests easier. By using the POM, testers can create a more organized and maintainable framework for their test automation efforts.

1) Features

2) Steps

3) Pages

Features:

This folder contains another two folders, i.e., pageobjects, Step-definitions, and features files. The “feature” folder is typically used in the context of behavior-driven development (BDD) frameworks such as Cucumber, which uses a natural language syntax to describe test scenarios. The “feature” folder houses files that define the scenarios or features to be tested.

To create a feature file in VSCode for implementing behavior-driven development (BDD) scenarios using Cucumber, you can follow these steps:

· Open VSCode and navigate to the folder where you want to create the feature file.

· Right-click on the folder, go to “New File”, and click on it to create a new file.

· Give the file a name with the “.feature” extension, for example, “login.feature”

Feature: Checking calculator functionality 

Scenario: Verify addition on calculator
Given User is on the calculator page
When User taps on "4"
And User taps on operator
And User taps on "5"
Then User verifies the answer as "9"

Scenario Outline: Verify user can perform multiple operation
Given User is on the calculator page
When User clicks on num1 "<number1>"
The user clicks on the "<operator>"
And User clicks on num2 "<number2>"
Then User verifies "<answer>"
Examples:
    | number1 |number2 | operator | answer |
    | 4       | 5      | +        | 9      |
    | 5       | 3      | -        | 2      |
    | 4       | 5      | *        | 20     |
    | 6       | 2      | /        | 3      |

In the above feature file, I have shown one simple scenario where I have performed a simple addition operation, and in the next scenario, I have created a scenario outline where different operations are performed, including addition, subtraction, multiplication, and division.

Step-definitions:

The “step-definitions” folder contains files or classes that define the behavior or actions associated with each step in the BDD scenario.

In WebDriverIO, you can generate step definitions for the given scenarios in a feature file using a tool called “cucumber-boilerplate”.

Following are the steps to generate steps in WebDriverIO using cucumber:

Install the “cucumber-boilerplate” package as a development dependency by running the following command in your project directory: “npm install cucumber-boilerplate –save-dev”

Once the installation is complete, you can generate the step definitions by running the following command: “npx cucumber-boilerplate generate”

This will prompt you to enter the path to the feature file for which you want to generate the steps.

Provide the path to the feature file (e.g., “./features/login.feature”) and press Enter.

The tool will generate the step definitions in JavaScript format, which you can then copy and paste into your WebDriverIO project’s step definition files.

const { Given, When, Then } = require('@wdio/cucumber-framework');
const addPage = require('../pageobjects/AddPage');
Given(/^User is on calculator page$/, async () => {
    await addPage.visitWeb()
  });

When(/^User taps on "(\d+)"$/, async (num) => {
    await addPage.tapNumber(num)
})

When(/^User taps on operator$/, async () => {
    await addPage.tapOperator()
}
Then(/^User verifies answer as "(\d+)"$/, async (ans) => {
    await addPage.getAns(ans)
})

When(/^User clicks on num1 "([^"]*)"$/, async (num1) => {
    await addPage.clickNum1(num1)
})

When(/^User clicks on num2 "([^"]*)"$/, async (num2) => {
    await addPage.clickNum2(num2)
})

When(/^User clicks on the "([^"]*)"$/, async(opt) =>{
   await addPage.clickOperator(opt)
})

Then(/^User verifies "([^"]*)"$/, async(ans) =>{
   await addPage.verifyAnswer(ans);
})

In the above code, you can see we have integrated steps for each line of the feature file, so we can run code in BDD format.

Page objects:

These are classes that represent a web page, containing methods and properties that interact with the page’s elements, such as buttons, links, and input fields.

const { config } = require("../../wdio.conf");
const assert = require('assert');
const addPageLoc = require("../../Locators/AddPageLocators")
const scr = require('../pageobjects/ScreenshotPage')
class AddPage{
    constructor(){
        this.plusOpt = addPageLoc.plusOpt;
        this.answer = addPageLoc.answer;
    }
    // Since we parameterized the value for the locator, we kept it as is.
    getNumber(num){
        return $('[onclick="r('+num+')"]')
    }
    async tapNumber(num){
        await this.getNumber(num).click();
        scr.takeScreenshot('tapping_number');
    }
    async tapOperator(){
        await this.plusOpt.click()
        await browser.pause(3000);
        scr.takeScreenshot('tapping_operator');
    }
    async getAns(){
        let txt = await this.answer.getText()
        console.log("Answer of addition: " +txt);
        scr.takeScreenshot('gettingTextOfElement');
    }

    async visitWeb(){
        await browser.url(config.baseUrl)
        scr.takeScreenshot('webUrl');
    }

    async clickNum1(num1){
        await this.getNumber(num1).click();
        scr.takeScreenshot('clicking_number1');
    }
    async clickNum2(num2){
        await this.getNumber(num2).click();
        scr.takeScreenshot('clicking_number2');
    }
    async clickOperator(opt){
        await $('[onclick="r(\''+opt+'\')"]').click();
        // await this.operator.replace('XXX', opt).click();
        scr.takeScreenshot('clicking_operator');
    }
    async verifyAnswer(ans){
        let result = await this.answer.getText()
        console.log("Retrieving text value from element: " +result)
        assert.equal(result,parseInt(ans));
        scr.takeScreenshot('verifyingResult');
    }
}
module.exports = new AddPage();

The browser.pause() method was used to pause it for the specified amount of time. It takes time in milliseconds.

Also, we added methods to the “AddPage” class, such as click() and setValue(), that are necessary to perform operations on web elements. Also, the setValue() method has been used for sending values for web elements.

Locators:
This folder includes all the locators required to operate web elements

module.exports = {
    plusOpt: $('[onclick="r(\'+\')"]'),
    answer: $('[id="sciOutPut"]'),
    operator: $('[onclick="r(\'XXX\')"]'
}

In the above code, we listed out all the locators in one file and then imported them into pages, removing clumsiness from the code

Now that we have completed implementing the Page Object Model (POM) design pattern, we can consider incorporating additional functionalities to further enhance the framework’s suitability and reliability.

Screenshots:

To add screenshot functionality to your code, you need to incorporate the following code into your implementation:

class ScreenshotPage{
    takeScreenshot(filename) {
        const timestamp = new Date().getTime();
        const filepath = `./screenshots/${filename}_${timestamp}.png`;
        browser.saveScreenshot(filepath);
      }
}

module.exports = new ScreenshotPage();

Import this code into the page where you need to capture a screenshot by calling takeScreenshot(‘nameOfScreenshot’).

The above image displays the screenshots it took. The sequence of screenshots offers an overview of the test case, illustrating actions taken at each step.

Cross-Browser Testing:

Cross-browser testing is a practice in software testing that involves testing a web application or website across multiple web browsers and browser versions to ensure its consistent functionality and appearance across different browser environments.

Capabilities:

In the wdio.conf.js file, make changes similar to what I have done in the ‘capabilities’ section. I have attached the following code for your reference. You can use it for assistance and make changes accordingly.

capabilities: [
    {
        browserName: 'chrome',
        acceptInsecureCerts: true
      },
      {
        browserName: 'firefox',
        acceptInsecureCerts: true
     }
],

Services:

In the ‘services’ section of the wdio.conf.js file, make changes similar to what I did in the following code snippets. You can make changes accordingly and run your test cases smoothly.

services: ['selenium-standalone'],
    seleniumInstallArgs: {
        drivers: {
          chrome: { version: 'latest' },
          firefox: { version: 'latest' },
          chromiumedge: { version: 'latest' },
        },
      }
      seleniumArgs: {
        drivers: 
          chrome: { version: 'latest' },
          firefox: { version: 'latest' },
          chromiumedge: { version: 'latest' },
        },
      },

The above code will assist you in implementing different browsers for testing, and you can also add others like Microsoft Edge, Safari, etc.

Allure_Report:

Allure Reports are often preferred over Cucumber Reports due to their visually appealing visualizations, comprehensive insights, step-by-step details, time tracking, integration capabilities, and historical trend analysis.

Once you have completed the automation process, testers need to generate reports to track the status of test cases, including pass or fail results and the exact location of failures. You can use the Allure report functionality for this purpose in your WebDriverIO project Follow these steps to include the Allure report:

1. Install the Allure Reporter plugin for WebDriverIO using the following command: “npm install @wdio/allure-reporter –save-dev”

2. Add the Allure Reporter plugin to your wdio.conf.js file as a reporter. Following is an example configuration:

exports.config = {
reporters: ['spec', ['allure', {
outputDir: './allure-results',
disableWebdriverStepsReporting: true,
disableWebdriverScreenshotsReporting: false,
}]],
}

In this example, we’re using the spec reporter for console output and the allure reporter for generating the allure report. The outputDir option specifies the directory where it will generate the report files.

1. Add the Allure command line tool to your project by running the following command: “npm install allure-commandline –save-dev”

2. After running your tests, generate the Allure report by running the following command: “npx allure generate allure-results –clean”

Project Folder Structure:

As we have completed the design of the folder structure for the framework, you can now see below the image of what the framework’s folder structure looks like.

The image above shows the integration of different folders in the WebdriverIO framework. I have provided explanations for each folder and its contents.

To run test cases on a browser, you can use the following commands:

· npx wdio wdio.conf.js

· npx wdio run wdio.conf.js –spec features\Addition.feature // To run a specific feature file

· npx wdio wdio.conf.js –spec ./path/to/your/test.js –browser chrome // To run on a specific browser”

Note: Please make sure to replace the path and file names with the appropriate ones for your specific setup

Conclusion:

WebdriverIO is a comprehensive and feature-rich framework that empowers developers and testers to create reliable and efficient automation tests for web applications. It is a vast ecosystem of plugins, extensive documentation, and also active community support make it a top choice for automation testing in the modern web development landscape. By adopting WebdriverIO, organizations can significantly improve their web application testing efforts and deliver high-quality software to their end-users.