Automated end to end testing is the most effective way to test your app. This tutorial explains how to use Selenium and Docker for End-to-End Testing. The Ultimate Guide to End to End tests with Selenium and Docker. Automated End-To-End Testing with Selenium and Docker
Automated end to end testing is the most effective way to test your app. It also requires the least effort to get the benefit of tests if you currently have no tests at all. And you don’t need a ton of infrastructure or cloud servers to get there. In this guide we’ll see how we can easily overcome the two main hurdles of end to end testing.
The first hurdle is Selenium. The API you use to write tests is ugly and non-intuitive. But it is not that difficult to use, and with a few convenient functions it can become a breeze to write Selenium tests. The effort is well rewarded because you can automatically test your end users' flows end to end.
The second hurdle is putting components together in an isolated environment. We want the frontend, the backend, the database and everything else your app uses. We will use Docker compose to put things together and automate the tests. It is easy even if you have your components in different Git repositories.
Even if you are an API only business, you have a frontend, and an admin or back office frontend. So end to end tests ultimately talk to a frontend application. The industry standard tool is Selenium. Selenium provides an API to talk to the web browser and interact with the DOM. You can check what elements are displayed, fill inputs and click around. Anything a real user would do with your application, you can automate.
Selenium uses something called the WebDriver API. It is not very handy to use at first glance. But the learning curve is not steep. Creating a few convenience functions will get you productive in no time. I won’t go into the details of the WebDriver API here.
There are also libraries to make your life easier. Nightwatch is the most popular.
If you have an Angular application, Protractor is your best friend. It integrates with the Angular event loop and allows you to use selectors based on your model. That is gold.
Writing a test for your most critical user feature or your app should take only a few hours, so go ahead. It will run automatically ever after. Let's see how.
We need to run our tests in an isolated environment so the outcome is predictable. And so we can enable Continuous Integration easily. We'll use Docker compose.
Selenium provides Docker images out of the box to test with one or several browsers. The images spawn a Selenium server and a browser underneath. It can work with different browsers.
Let’s start with one browser for now: headless-chrome. The docker-compose.yml file looks as below (commands are from an Angular example).
Note: If you've never used Docker you can easily install it on your computer. Docker has the troublesome tendency of forcing you to sign up for an account just to download the thing. But you actually don't have to. Go to the release notes (link for Windows and link for Mac) and download not the latest version but the one right before. This is a direct download link.
version: '3.1' services: app-serve: build: . image: myapp command: npm run serve:production expose: - 4200 app-e2e-tests: image: myapp command: dockerize -wait tcp://app-serve:4200 -wait tcp://selenium-chrome-standalone:4444 -timeout 10s -wait-retry-interval 1s bash -c "npm run e2e" depends_on: - app-serve - selenium-chrome-standalone selenium-chrome-standalone: image: selenium/standalone-chrome expose: - 44444
The file above tells Docker to spin up an environment with 3 containers:
We need a Dockerfile to build the myapp image used for the containers. It should turn your frontend code into a bundle as close to production as possible. Running unit tests and linting is a good idea at that stage. After all no need to run end to end tests if the basics do not work.
In the Dockerfile below we use a node image as base, install dockerize and bundle the application. It is important to build for production. You don’t want to test a development build that is pre-optimizations. Many things can go wrong there.
FROM node:12.13.0 AS base ENV DOCKERIZE_VERSION v0.6.0 RUN wget https://github.com/jwilder/dockerize/releases/download/$DOCKERIZE_VERSION/dockerize-alpine-linux-amd64-$DOCKERIZE_VERSION.tar.gz \ && tar -C /usr/local/bin -xzvf dockerize-alpine-linux-amd64-$DOCKERIZE_VERSION.tar.gz \ && rm dockerize-alpine-linux-amd64-$DOCKERIZE_VERSION.tar.gz RUN mkdir -p ~/app WORKDIR ~/app COPY package.json . COPY package-lock.json . FROM base AS dependencies RUN npm install FROM dependencies AS runtime COPY . . RUN npm run lint RUN npm run test:ci RUN npm run build --production
Now that we have all the pieces together let's run the tests using this command:
docker-compose up --build --abort-on-container-exit
It is long-ish, so script it in your project somehow. What it will do is build the myapp image based on the provided Dockerfile and then start all the containers. Dockerize makes sure your app and Selenium are up before executing the tests.
The --abort-on-container-exit option will kill the environment when one container exists. Since only our testing container is meant to exit (the others are servers), that is what we want.
The docker-compose command will have the same exit code as the exiting container. It means you can easily detect from the command line if the tests succeeded or not.
You are now ready to run end to end tests locally and on any server supporting Docker. That's pretty good!
Tests run with only one browser for now, though. Let’s add more.
The standalone Selenium image spins up a Selenium server with the browser you want. To run the tests on different browsers you need to update your tests' configuration and use the selenium/hub Docker image.
The image creates a hub between your application and the standalone Selenium images. Replace the selenium container section in your docker-compose.yml as follows:
selenium-hub: image: selenium/hub container_name: selenium-hub expose: - 4444 chrome: image: selenium/node-chrome volumes: - /dev/shm:/dev/shm depends_on: - selenium-hub environment: - HUB_HOST=selenium-hub - HUB_PORT=4444 firefox: image: selenium/node-firefox volumes: - /dev/shm:/dev/shm depends_on: - selenium-hub environment: - HUB_HOST=selenium-hub - HUB_PORT=4444
We now have 3 containers: Chrome, Firefox and the Selenium hub.
All the Docker images provided by Selenium are in this repository.
Careful! There is a tricky timing effect to consider. We use dockerize to have our test container wait for the Selenium hub to be up. That is not enough because we need to wait for the standalone images to be ready – in fact to register themselves to the hub.
We can do this by waiting for the standalone images to expose a port but that is not a guarantee. It is easier to wait a few seconds before starting the tests. Update your test script to wait 5 seconds before the tests start (add a sleep command after the dockerize call).
Now you can be sure that your tests won’t start until all the browser are ready. Waiting is not ideal but a few seconds are worth it. There is nothing more annoying than failing tests because of unstable automations.
Good. We have now covered the frontend part. Let’s add the backend.
The above might seem overkill to test only the front end part of your app. But we're aiming for much more. We want to test the whole system.
Let’s add a database and a backend to our Docker compose environment.
If you are a front end developer you might think "we are the frontend team we don’t care about testing the backend". Are you sure?
The frontend is always the last part to integrate all the other pieces. That means crunch time. Crunch time that would no longer be if you were able to test the frontend with the backend continuously and catch errors sooner.
The technique I describe here is very easy to apply even if your backend is in a different repository.
This is what the docker-compose.yml looks like:
version: '3.1' services: db: image: postgres environment: POSTGRES_USER: john POSTGRES_PASSWORD: mysecretpassword expose: - 5432 backend: context: ./backend dockerfile: ./backend/Dockerfile image:mybackend command: dockerize -wait tcp://db:5432 -timeout 10s bash -c "./seed_db.sh && ./start_server.sh" environment: APP_DB_HOST: db APP_DB_USER: john APP_DB_PASSWORD: mysecretpassword expose: - 8000 depends_on: - db app-serve: build: . image: myapp command: npm run serve:sw expose: - 4200 app-e2e-tests: image: myapp command: dockerize -wait tcp://app-serve:4200 -wait tcp://backend:8000 -wait tcp://selenium-chrome-standalone:4444 -timeout 10s -wait-retry-interval 1s bash -c "npm run e2e:docker" depends_on: - app-serve - selenium-chrome-standalone selenium-chrome-standalone: image: selenium/standalone-chrome expose: - 44444
In this example we added a postgres database and a container for the backend to run. Dockerize synchronizes the containers' commands.
If your system has more than one backend component add as many containers as you need. You need to wire the container dependencies properly. This means proper hostnames as environment variables on your components. And order of startup if some components depend on others.
The Selenium tests you have written should not need any modifications. You might need to put test data in the database. To keep this step in the testing area we added the seeding script before the backend startup script. This way we are sure that things happen in the proper order:
If you look at the backend container you can see there is a catch. It uses an image called mybackend built from a file located at backend/Dockerfile. This implies that your backend is in the same git repository in a folder called backend. The name is just an example of course.
If your backend and frontend are in the same repository you are good. Define the Dockerfile to build your backend and adjust the startup command to what you need.
That’s all good but usually the backend is not in the same repository. Or you can have many backend components in different repositories. What do you do then?
The super clean solution is to have a CI process on each backend component repository.
The CI process for each component runs automated tests. Upon success it pushes a docker image with the component to a private Docker registry. The backend container in our docker-compose.yml file above would use this image.
This solution requires a private Docker registry to store your images. You can use Docker Hub but then it becomes public. If you don't have one already and don’t plan to do so, it is not worth the effort.
The other solution is to use the submodules feature in git. Your backend repository becomes a virtual child of your frontend repository. You just need to add the file .gitmodules like this to your frontend repository:
[submodule "backend"] path = backend url = [email protected]:backend/repository.git branch = develop
Run the command git submodule update --remote whichwill pull the specified branch of the backend repository into a folder called "backend". Add as many submodules as you need if you have more than one backend component.
That’s it. Have your CI run the submodule command and from a file system perspective you are as in a monorepository.
If you don’t want the backend code locally while developing the frontend just don’t run the command. You’ll have an empty backend folder.
The 2 techniques above test the frontend with the latest “CI tests passed” version of your backend. That may lead to broken builds if your components are not compatible at times.
If they are compatible more often than not, stick to the “always test with the latest versions” approach. You’ll fix the occasional incompatibilities on the fly.
That won’t work, though, if incompatibilities are business as usual. In this case you need to manually control version updates. That is very easy to do.
You can lock the version of a component in the docker-compose.yml file or in the .gitmodules file. When pushing to the Docker registry you would tag the component image with the commit number of the corresponding code. The relevant docker-compose.yml file section becomes:
backend: image: backendapp:34028fc
Similarly the .gitmodules file would not target a branch head but a given commit:
[submodule "backend"] path = backend url = [email protected]:backend/repository.git branch = 34028fc
Bonus: version updates are versioned with your code. You can track which version was used for each build. This is useful when fixing failed builds or trying to reproduce old bugs.
We could push the approach to the next level. You could have a dedicated repository that would wire all your components as git modules. Bumping the versions could be a form of delivery and handover to the test/QA team.
In theory it is best to keep the latest versions of components working together more often than not. And drop the need for manual versioning. If that is not the case that is OK. Ignore the purists who will tell you that you are not following best practices and so on.
If you are just starting don't aim for the stars at first. Pick what works best for you to enjoy the benefits of automated testing right now. Then keep improving your process along the way.
Back to Selenium and 3 important bits of advice to help you write good UI tests.
First, avoid CSS selectors if you can. Selenium works on the DOM and can identify elements by IDs or CSS or XPath. Use IDs as much as possible even if you have to add them to your app code for only this purpose. CSS and XPath selectors are shaky. As soon as your application structure changes, they will be broken.
Second, use the Page Objects approach. It is about encapsulating your application so selectors are not directly used in tests. If your page HTML/CSS changes, your tests will have to be rewritten to use new selectors. Page Objects abstract selectors and turn them into user actions. Here is a great article on how to use Page Objects properly.
Third, don’t build long user journeys in your tests. If your tests fail at the 50th action it’s going to be difficult to reproduce and fix. Create test suites that play part of the scenarios starting from the login page. This way you are always a few clicks away from the bug your tests will catch.
Also don’t risk writing tests that rely on state from previous actions. Test suite coupling is something you want to avoid.
Let's take a practical example. Say you are testing a SaaS application for schools. The use cases could be:
Along the way you will have the login process and some navigation checks.
You could write a test that goes through the whole chain as described above. And this would be convenient because to declare kids you need a class to exist. To check presence/absences you need a weekly plan in place. And you need kids to input grades. It’s a quick win to build 1 test suite that does all these things at first.
If you have nothing at the moment and want to achieve good test coverage quickly: go for it! Done is better than perfect if it allows you to catch errors in your application now.
The cleaner solution would be to use a baseline scenario to start smaller test suites. In the example above the baseline scenario should be to create a class and register kids data.
Create a test suite that does exactly that: create a class and registered kids and parents data. Always run it first. If this stops working then you don’t need to move further on. This version of the code will never reach end users anyway.
Then create a function that encapsulates the baseline scenario. It will be duplicate code to some extent with the previous test suite. But it will allow you to have a one line function to use as a setup hook for all the other test suites. This is the best of both worlds: test scenarios starting from a fresh state in the application with minimal effort.
I hope this gave you a good insight on how you can quickly set up end to end tests for a complex system. Multiple components in multiple repositories should not be a barrier. Docker compose makes it easy to put things together.
End to end tests are the best way to avoid crunch times. In complex systems late deliveries of some components put a burden on other teams. Integrations are done in a rush. Code quality drops. That's a vicious circle. Testing often and catching cross component errors early is the solution.
Selenium tests can be done quick and dirty to get going fast. That is perfectly OK. Automate things. Then improve. Remember:
Done is better than perfect any day of the year.
Thanks for reading!
What is OpenJDK? OpenJDk or Open Java Development Kit is a free, open-source framework of the Java Platform, Standard Edition (or Java SE).
H2Kinfosys is offering Selenium online training with selenium webdriver, selenium remote control, history of selenium webdriver and Junit. Learn selenium Java online from our software testing experts.
Selenium Frameworks are used to optimize the code structure and help in code maintenance. Selenium Framework using Java will help you learn how the three common Selenium frameworks can be created.
The demand for delivering quality software faster — or “Quality at Speed” — requires organizations to search for solutions in Agile, continuous integration (CI), and DevOps methodologies. Test automation is an essential part of these aspects.
Following the second video about Docker basics, in this video, I explain Docker architecture and explain the different building blocks of the docker engine; docker client, API, Docker Daemon. I also explain what a docker registry is and I finish the video with a demo explaining and illustrating how to use Docker hub.