The project practices Java Selenium 4.0.x release
ChromiumDriver
to execute the Chrome DevTools Protocol a.k.a.
cdp commands - an entirely different set of API communicated to the Chrome browser family via POST
requests to /session/$sessionId/goog/cdp/execute
with API-specific payload) feature (many of the cdp methods e.g. the DOM ones like
performSearch
,getSearchResults
getNodeForLocation
getOuterHTML
querySelectorAll
querySelector
getAttributes
overlap with classic Selenium in Classic Javascript and there are few specific ones like:
addCustomHeaders
getFrameTree
setGeolocationOverride
setDownloadBehavior
to name a few, and various event listeners
This functionality is named in official Selenium Developer Documentation as BiDirectional functionality and BiDi API
The project also exercised other new Selenium 4 API e.g. relative nearby locators whidh did not apear powerful enough yet.
For accessing the Chrome Devtools API with Selenium driver 3.x see cdp_webdriver project
This test is opening Wikipedia page and hovers over few links using "classic" Selenium Actions
class:
driver.findElement(By.id("mw-content-text")).findElements(By.tagName("a")).stream().forEach( (WebElement element ) -> {
new Actions(driver).moveToElement(element).build().perform();
}
}
To emphacise that the lambda operates "classic" object,the WebElement
type was entered explicitly.
In the @Before
-annotated method in the test class, the Fetch
API is enabled
for all requests
@Before
public void beforeTest() throws Exception {
chromeDevTools = ((HasDevTools) driver).getDevTools();
List<RequestPattern> reqPattern = new ArrayList<>();
reqPattern.add(new RequestPattern(Optional.of("*"), Optional.of(ResourceType.XHR), Optional.of(RequestStage.RESPONSE)));
chromeDevTools.send(Fetch.enable(Optional.of(reqPattern), Optional.of(false)));
(If necessary one can limit to subset of reuests via match pattern). Then in the test method callback is set up:
@Test
public void test() {
chromeDevTools.addListener(Fetch.requestPaused(),
(RequestPaused event) -> {
event.getResponseHeaders().get().stream().map((HeaderEntry entry) -> String.format("%s: %s",
entry.getName(), entry.getValue())).collect(Collectors.toList());
Fetch.GetResponseBodyResponse response = chromeDevTools.send(Fetch.getResponseBody(event.getRequestId()));
String body = new String(Base64.decodeBase64(response.getBody().getBytes("UTF8")));
System.err.println("response body:\n" + body);
}
});
// he mouse hover actions to follow
This allows capture every Ajax request response headers,
List<HeaderEntry> headerEntries = event.getResponseHeaders().isPresent() ? event.getResponseHeaders().get() : new ArrayList<>();
List<String> headers = headerEntries.stream().map(entry -> String.format("%s: %s", entry.getName(), entry.getValue())) .collect(Collectors.toList());
along with response status
event.getResponseStatusCode().get()
and body which is usually a base64 encoded JSON with multiple details, processed by browser
Fetch.GetResponseBodyResponse response = chromeDevTools.send(Fetch.getResponseBody(event.getRequestId()));
String body = null;
if (response.getBase64Encoded()) {
try {
body = new String( Base64.decodeBase64(response.getBody().getBytes("UTF8")));
} catch (UnsupportedEncodingException e) {
System.err.println("Exception (ignored): " + e.toString());
}
} else {
body = response.getBody();
}
finally the test continues default processing of the request:
chromeDevTools.send(Fetch.continueRequest(
event.getRequestId(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty()));
- the arguments to the Java adapter method match the Javascript
Fetch.continueResponse
parameter definition:
requestId
RequestId
An id the client received in requestPaused event.
responseCode
integer
An HTTP response code. If absent, original response code will be used.
responsePhrase
string
A textual representation of responseCode. If absent, a standard phrase matching responseCode is used.
responseHeaders
array[ HeaderEntry ]
Response headers. If absent, original response headers will be used.
binaryResponseHeaders
string
Alternative way of specifying response headers as a \0-separated series of name: value pairs. Prefer the above method unless you need to represent some non-UTF8 values that can't be transmitted over the protocol as text. (Encoded as a base64 string when passed over JSON)
Browser console logs may accessed asynchronuosly in asimilar fashion:
@Before
public void beforeTest() throws Exception {
chromeDevTools.send(Log.enable());
chromeDevTools.addListener(Log.entryAdded(),
(LogEntry event) -> System.err.println(
String.format( "time stamp: %s line number: %s url: \"%s\" text: %s",
formatTimestamp(event.getTimestamp()),
(event.getLineNumber().isPresent() ? event.getLineNumber().get() : ""),
(event.getUrl().isPresent() ? event.getUrl().get() : ""),
event.getText())));
}
The properties of the event are taken from Log entry
object specification
One can also confirm the logging event to have expected properties, e.g. message:
@Test
public void test() {
final String consoleMessage = "Lorem ipsum";
chromeDevTools.addListener(Log.entryAdded(),
(LogEntry event) -> assertThat(event.getText(), containsString(consoleMessage)));
if (driver instanceof JavascriptExecutor) {
JavascriptExecutor executor = JavascriptExecutor.class.cast(driver);
executor.executeScript("console.log(arguments[0]);", consoleMessage);
}
}
This API uses CDP command:
public void test1() {
PrintToPDFResponse response;
boolean landscape = false;
boolean displayHeaderFooter = false;
boolean printBackground = false;
Page.PrintToPDFTransferMode transferMode = Page.PrintToPDFTransferMode.RETURNASBASE64;
int scale = 1;
// Act
response = chromeDevTools.send(Page.printToPDF(
Optional.of(landscape),
Optional.of(displayHeaderFooter),
Optional.of(printBackground),
Optional.of(scale),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.of(transferMode)));
assertThat(response, notNullValue());
String body = new String(Base64.decodeBase64(response.getData().getBytes("UTF8")));
assertThat(body, notNullValue());
String magic = body.substring(0, 9);
assertThat(magic, containsString("%PDF"));
the browser needs to run headless mode for the call to succeed the alternative call signature is
response = chromeDevTools.send(new Command<PrintToPDFResponse>("Page.printToPDF", ImmutableMap.of("landscape", landscape), o -> o.read(PrintToPDFResponse.class)));
assertThat(response, notNullValue());
for some calls (but not specifically for Page.printToPDF
) yet anoher alternavie signature via static method exists
response = chromeDevTools.send(new Command<PrintToPDFResponse>("Page.printToPDF", ImmutableMap.of("landscape", landscape), ConverterFunctions.map("data", PrintToPDFResponse.class)));
in additon to legacy-like keyboard zoom, the CDP supports Page.setDeviceMetricsOverride
method and Emulation.setDeviceMetricsOverride
method:
@Before
public void before() throws Exception {
baseURL = "https://www.wikipedia.org";
driver.get(baseURL);
}
@Test
public void test1() {
for (int cnt = 0; cnt != deviceScaleFactors.length; cnt++) {
double deviceScaleFactor = deviceScaleFactors[cnt];
screenshotFileName = String.format("test1_%03d.jpg",
(int) (100 * deviceScaleFactor));
layoutMetrics = chromeDevTools.send(Page.getLayoutMetrics());
rect = layoutMetrics.getContentSize();
width = rect.getWidth().intValue();
height = rect.getHeight().intValue();
System.err.println(String.format("Content size: %dx%d", width, height));
chromeDevTools.send(
// @formatter:off
Emulation.setDeviceMetricsOverride(
rect.getWidth().intValue(),
rect.getHeight().intValue(),
deviceScaleFactor,
false,
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty(),
Optional.empty()
)
// @formatter:on
);
String dataString = chromeDevTools.send(
// @formatter:off
Page.captureScreenshot(
Optional.of(Page.CaptureScreenshotFormat.JPEG),
Optional.of(100),
Optional.empty(),
Optional.of(true),
Optional.of(true)
)
// @formatter:off
);
chromeDevTools.send(Emulation.clearDeviceMetricsOverride());
byte[] image = base64.decode(dataString);
try {
BufferedImage o = ImageIO.read(new ByteArrayInputStream(image));
System.err.println(String.format("Screenshot dimensions: %dx%d",
o.getWidth(), o.getHeight()));
assertThat((int) (width * deviceScaleFactor) - o.getWidth(),
not(greaterThan(2)));
assertThat((int) (height * deviceScaleFactor) - o.getHeight(),
not(greaterThan(2)));
} catch (IOException e) {
System.err.println("Exception loading image (ignored): " + e.toString());
}
try {
FileOutputStream fileOutputStream = new FileOutputStream(
screenshotFileName);
fileOutputStream.write(image);
fileOutputStream.close();
} catch (IOException e) {
System.err.println("Exception saving image (ignored): " + e.toString());
}
}
}
@After
public void clearPage() {
chromeDevTools.send(CSS.disable());
try {
chromeDevTools.send(DOM.disable());
} catch (DevToolsException e) {
// DOM agent hasn't been enabled
}
driver.get("about:blank");
}
this test gets gradually magnified out page screen shots:
alternatively use CDP commands for the same:
@SuppressWarnings("unchecked")
@Test
public void test() {
// Assert
params = new HashMap<>();
for (int cnt = 0; cnt != deviceScaleFactors.length; cnt++) {
double deviceScaleFactor = deviceScaleFactors[cnt];
filename = String.format("test2_%03d.jpg",
(int) (100 * deviceScaleFactor));
try {
command = "Page.getLayoutMetrics";
result = driver.executeCdpCommand(command, new HashMap<>());
System.err
.println("Page.getLayoutMetrics: " + result.get("contentSize"));
rect = (Map<String, Long>) result.get("contentSize");
height = rect.get("height");
width = rect.get("width");
command = "Emulation.setDeviceMetricsOverride";
// Act
System.err.println(String.format("Scaling to %02d%% %s",
(int) (100 * deviceScaleFactor), filename));
params.clear();
params.put("deviceScaleFactor", deviceScaleFactor);
params.put("width", width);
params.put("height", height);
params.put("mobile", false);
params.put("scale", 1);
driver.executeCdpCommand(command, params);
Utils.sleep(delay);
command = "Page.captureScreenshot";
// Act
result = driver.executeCdpCommand(command,
new HashMap<String, Object>());
command = "Emulation.clearDeviceMetricsOverride";
driver.executeCdpCommand(command, new HashMap<String, Object>());
// Assert
assertThat(result, notNullValue());
assertThat(result, hasKey("data"));
dataString = (String) result.get("data");
assertThat(dataString, notNullValue());
byte[] image = base64.decode(dataString);
BufferedImage o = ImageIO.read(new ByteArrayInputStream(image));
assertThat(o.getWidth(), greaterThan(0));
assertThat(o.getHeight(), greaterThan(0));
FileOutputStream fileOutputStream = new FileOutputStream(filename);
fileOutputStream.write(image);
fileOutputStream.close();
} catch (IOException e) {
System.err.println("Exception saving image (ignored): " + e.toString());
} catch (JsonSyntaxException e) {
System.err.println("JSON Syntax exception in " + command
+ " (ignored): " + e.toString());
} catch (WebDriverException e) {
// willbe thrown if the required arguments are not provided.
// TODO: add failing test
System.err.println(
"Web Driver exception in " + command + " (ignored): " + Utils
.processExceptionMessage(e.getMessage() + " " + e.toString()));
} catch (Exception e) {
System.err.println("Exception in " + command + " " + e.toString());
throw (new RuntimeException(e));
}
}
}
Bandwidth improving filtering of certain mask URLs
chromeDevTools.send(Network.enable(Optional.of(100000000), Optional.empty(), Optional.empty()));
chromeDevTools.send(Network.setBlockedURLs(ImmutableList.of("*.css", "*.png", "*.jpg", "*.gif", "*favicon.ico")));
driver.get("http://arngren.net");
one can also log the *.css
, *.jpg
*.png
and *.ico
blocking in action:
// verify that
chromeDevTools.addListener(Network.loadingFailed(),
(LoadingFailed event) -> {
ResourceType resourceType = event.getType();
if (resourceType.equals(ResourceType.STYLESHEET)
|| resourceType.equals(ResourceType.IMAGE)
|| resourceType.equals(ResourceType.OTHER)) {
Optional<BlockedReason> blockedReason = event.getBlockedReason();
assertThat(blockedReason.isPresent(), is(true));
assertThat(blockedReason.get(), is(BlockedReason.INSPECTOR));
}
System.err.println("Blocked event: " + event.getType());
});
finally one can disable filtering:
// set request interception only for css requests
RequestPattern requestPattern = new RequestPattern(Optional.of("*.gif"), Optional.of(ResourceType.IMAGE), Optional.of(InterceptionStage.HEADERSRECEIVED));
chromeDevTools.send(Network.setRequestInterception(ImmutableList.of(requestPattern)));
chromeDevTools.send(Page.navigate(baseURL, Optional.empty(),Optional.empty(), Optional.empty(), Optional.empty()));
One can call cdp protocol to invoke setUserAgentOverride method and dynmically modify the user-agent
header during the test:
import org.openqa.selenium.chrome.ChromeDriver;
import org.openqa.selenium.chromium.ChromiumDriver;
ChromiumDriver driver = new ChromeDriver();
driver.get("https://www.whoishostingthis.com/tools/user-agent/");
By locator = By.cssSelector(".user-agent");
WebElement element = driver.findElement(locato);
assertThat(element.getAttribute("innerText"), containsString("Mozilla"));
Map<String, Object> params = new HashMap<String, Object>();
params.put("userAgent", "python 2.7");
params.put("platform", "Windows");
driver.executeCdpCommand("Network.setUserAgentOverride", params);
driver.navigate().refresh();
sleep(100);
element = driver.findElement(locator);
assertThat(element.isDisplayed(), is(true));
assertThat(element.getAttribute("innerText"), is("python 2.7"));
demonstrates that the user-agent is indeed changing
The example shows alternative API to collect the cookies available to page Javascript
Map<String, Object> result = driver.executeCdpCommand("Page.getCookies", new HashMap<String, Object>());
ArrayList<Map<String, Object>> cookies = (ArrayList<Map<String, Object>>) result.get("cookies");
cookies.stream().limit(100).map(o -> o.keySet()).forEach(System.err::println);
String result = driver.executeCdpCommand("Page.captureScreenshot", new HashMap<>());
String data = (String) result.get("data");
byte[] image = new (Base64()).decode(data);
assertThat(ImageIO.read(new ByteArrayInputStream(image)).getWidth(), greaterThan(0));
(new FileOutputStream("temp.png")).write(image);
implements the clipping to viewport functioality
command = "Page.captureScreenshot";
params = new HashMap<String, Object>();
Map<String, Object> viewport = new HashMap<>();
System.err.println("Specified viewport: " + String
.format("x=%d, y=%d, width=%d, height=%d", x, y, width, height));
viewport.put("x", (double) x);
viewport.put("y", (double) y);
viewport.put("width", (double) width);
viewport.put("height", (double) height);
viewport.put("scale", scale);
params.put("clip", viewport);
result = driver.executeCdpCommand(command, params);
dataString = (String) result.get("data");
assertThat(dataString, notNullValue());
Base64 base64 = new Base64();
byte[] image = base64.decode(dataString);
String screenshotFileName = String.format("card%02d.png", cnt);
FileOutputStream fileOutputStream = new FileOutputStream( screenshotFileName);
fileOutputStream.write(image);
fileOutputStream.close();
Note: some CDP API notably Page.printToPDF
are not curently implemented:
unhandled inspector error: {"code":-32000,"message":"PrintToPDF is not implemented"}(..)
This can be done both at the wrapper methods
// enable Network
chromeDevTools.send(Network.enable(Optional.empty(), Optional.empty(), Optional.empty()));
headers = new HashMap<>();
headers.put("customHeaderName", "customHeaderValue");
Headers headersData = new Headers(headers);
chromeDevTools.send(Network.setExtraHTTPHeaders(headersData));
The validation can be done through hooking assert and log message to the event:
// add event listener to log that requests are sending with the custom header
chromeDevTools.addListener(Network.requestWillBeSent(),
o -> Assert.assertEquals(o.getRequest().getHeaders().get("customHeaderName"), "customHeaderValue"));
chromeDevTools.addListener(Network.requestWillBeSent(), o -> System.err.println(
"addCustomHeaders Listener invoked with " + o.getRequest().getHeaders().get("customHeaderName")));
and low level "commands":
String command = "Network.enable";
params = new HashMap<>();
params.put("maxTotalBufferSize", 0);
params.put("maxPostDataSize", 0);
params.put("maxPostDataSize", 0);
result = driver.executeCdpCommand(command, params);
command = "Network.setExtraHTTPHeaders";
params = new HashMap<>();
Map<String, String> headers = new HashMap<>();
headers.put("customHeaderName", this.getClass().getName() + " addCustomHeadersTest");
params.put("headers", headers);
result = driver.executeCdpCommand(command, params);
To test one can e.g. fire a tomcat server with request header logging and
send the GET
request
driver.get("http://127.0.0.1:8080/demo/Demo");
The actual validation will be done through console logs inspection of the server
The following somewhat long test exercises steps one has to perform with CDP to get a specific DOM Node focused and act upon:
It appears every node search starts with getting the document:
@SuppressWarnings("unchecked")
@Test
public void getDocumentTest() {
// Arrange
driver.get("https://www.google.com");
String command = "DOM.getDocument";
try {
// Act
result = driver.executeCdpCommand(command, new HashMap<>());
// Assert
assertThat(result, hasKey("root"));
Map<String, Object> data = (Map<String, Object>) result.get("root");
assertThat(data, hasKey("nodeId"));
assertTrue(Long.parseLong(data.get("nodeId").toString()) != 0);
err.println("Command " + command + " return node: "
+ new Gson().toJson(data, Map.class));
} catch (org.openqa.selenium.WebDriverException e) {
err.println(
"Exception in command " + command + " (ignored): " + e.toString());
}
}
This test logs:
Command DOM.getDocument return node:
{
"backendNodeId": 1,
"baseURL": "https://www.google.com/",
"childNodeCount": 2,
"children": [
{
"backendNodeId": 2,
"localName": "",
"nodeId": 10,
"nodeName": "html",
"nodeType": 10,
"nodeValue": "",
"parentId": 9,
"publicId": "",
"systemId": ""
},
{
"attributes": [
"itemscope",
"",
"itemtype",
"http://schema.org/WebPage",
"lang",
"en"
],
"backendNodeId": 3,
"childNodeCount": 2,
"children": [
{
"attributes": [],
"backendNodeId": 21,
"childNodeCount": 12,
"localName": "head",
"nodeId": 12,
"nodeName": "HEAD",
"nodeType": 1,
"nodeValue": "",
"parentId": 11
},
{
"attributes": [
"jsmodel",
" ",
"class",
"hp vasq",
"id",
"gsr"
],
"backendNodeId": 22,
"childNodeCount": 8,
"localName": "body",
"nodeId": 13,
"nodeName": "BODY",
"nodeType": 1,
"nodeValue": "",
"parentId": 11
}
],
"frameId": "C3CE739B971DD10AFECA84F6C1554308",
"localName": "html",
"nodeId": 11,
"nodeName": "HTML",
"nodeType": 1,
"nodeValue": "",
"parentId": 9
}
],
"documentURL": "https://www.google.com/",
"localName": "",
"nodeId": 9,
"nodeName": "#document",
"nodeType": 9,
"nodeValue": "",
"xmlVersion": ""
}
now one can
command = "DOM.querySelector";
params.clear();
params.put("nodeId", nodeId);
params.put("selector", "img#hplogo");
try {
result = driver.executeCdpCommand(command, params);
assertThat(result, hasKey("nodeId"));
nodeId = (Long) result.get("nodeId");
assertTrue(nodeId != 0);
err.println("Command " + command + " returned nodeId: " + nodeId);
} catch (org.openqa.selenium.WebDriverException e) {
err.println(
"Exception in command " + command + " (ignored): " + e.toString());
}
command = "DOM.getOuterHTML";
params.clear();
params.put("nodeId", nodeId);
try {
result = driver.executeCdpCommand(command, params);
assertThat(result, notNullValue());
assertThat(result, hasKey("outerHTML"));
String dataString = (String) result.get("outerHTML");
assertThat(dataString, notNullValue());
err.println("Command " + command + " return outerHTML: " + dataString);
} catch (Exception e) {
err.println("Exception in " + command + " (ignored): " + e.toString());
}
}
This will log:
Command DOM.querySelector returned nodeId: 162
Command DOM.getOuterHTML return outerHTML:
<img alt="Google" height="92" id="hplogo" src="/images/branding/googlelogo/2x/googlelogo_color_272x92dp.png" style="padding-top:109px" width="272" onload="typeof google==='object'&&google.aft&&google.aft(this)" data-iml="1576602836994" data-atf="1">
collapsing multiple command calls together will lead to somewhat bloated test method
@Test
public void multiCommandTest() {
// Arrange
baseURL = "https://www.google.com";
driver.get(baseURL);
String command = "DOM.getDocument";
try {
// Act
result = driver.executeCdpCommand(command, new HashMap<>());
// Assert
assertThat(result, hasKey("root"));
@SuppressWarnings("unchecked")
Map<String, Object> node = (Map<String, Object>) result.get("root");
assertThat(node, hasKey("nodeId"));
nodeId = Long.parseLong(node.get("nodeId").toString());
assertTrue(nodeId != 0);
err.println("Command " + command + " returned nodeId: " + nodeId);
} catch (org.openqa.selenium.WebDriverException e) {
err.println(
"Exception in command " + command + " (ignored): " + e.toString());
}
command = "DOM.describeNode";
params = new HashMap<>();
params.put("nodeId", nodeId);
params.put("depth", 1);
try {
result = driver.executeCdpCommand(command, params);
// Assert
assertThat(result, hasKey("node"));
@SuppressWarnings("unchecked")
Map<String, Object> data = (Map<String, Object>) result.get("node");
for (String field : Arrays.asList(
new String[] { "nodeType", "nodeName", "localName", "nodeValue" })) {
assertThat(data, hasKey(field));
}
System.err.println("Command " + command + " returned node: " + data);
} catch (org.openqa.selenium.WebDriverException e) {
err.println(
"Exception in command " + command + " (ignored): " + e.toString());
}
command = "DOM.querySelector";
params = new HashMap<>();
params.put("nodeId", nodeId);
// params.put("selector", "img#hplogo");
params.put("selector", "input[name='q']");
try {
result = driver.executeCdpCommand(command, params);
// depth, 1
// Assert
assertThat(result, hasKey("nodeId"));
// @SuppressWarnings("unchecked")
nodeId = Long.parseLong(result.get("nodeId").toString());
assertTrue(nodeId != 0);
err.println("Command " + command + " returned nodeId: " + nodeId);
} catch (org.openqa.selenium.WebDriverException e) {
err.println(
"Exception in command " + command + " (ignored): " + e.toString());
}
command = "DOM.resolveNode";
params = new HashMap<>();
params.put("nodeId", nodeId);
try {
result = driver.executeCdpCommand(command, params);
// depth, 1
// Assert
assertThat(result, hasKey("object"));
// object
@SuppressWarnings("unchecked")
Map<String, Object> data = (Map<String, Object>) result.get("object");
for (String field : Arrays.asList(
new String[] { "type", "subtype", "className", "objectId" })) {
assertThat(data, hasKey(field));
}
String objectId = (String) data.get("objectId");
assertThat(objectId, notNullValue());
System.err
.println("Command " + command + " returned objectId: " + objectId);
} catch (org.openqa.selenium.WebDriverException e) {
err.println(
"Exception in command " + command + " (ignored): " + e.toString());
}
command = "DOM.something not defined";
try {
// Act
result = driver.executeCdpCommand(command, new HashMap<>());
} catch (org.openqa.selenium.WebDriverException e) {
err.println(
"Exception in command " + command + " (ignored): " + e.toString());
// wasn't found
}
// DOM.removeNode
command = "DOM.focus";
params = new HashMap<>();
params.put("nodeId", nodeId);
try {
// Act
result = driver.executeCdpCommand(command, params);
} catch (org.openqa.selenium.WebDriverException e) {
err.println(
"Exception in command " + command + " (ignored): " + e.toString());
// : unknown error: unhandled inspector error:
// {"code":-32000,"message":"Element is not focusable"}
}
command = "DOM.highlightNode";
try {
// Act
result = driver.executeCdpCommand(command, new HashMap<>());
Utils.sleep(10000);
} catch (org.openqa.selenium.WebDriverException e) {
err.println(
"Exception in command " + command + " (ignored): " + e.toString());
}
// TODO: command = "Runtime.callFunctionOn";
}
The selenium-chromium-driver that is only available for Selenum release 4 is the critical dependency jar of this project. The selenium-chromium-driver repository search page.
The devtools and chromium subprojects of selenium client of official seleniumhq/selenium project have no dependencies and can be cloned and built locally allowing one to use CDP API with Selenium 3.x e.g. Selenium 3.13.0. This is currently attempted this way in this project. Moving away form default 4.0.0.alpha maven profiles is a work in progress.
With Selenium driver release 4.0.0-alpha-7 just to make the project compile changes imported package names need to change all
org.openqa.selenium.devtools.browser
references with org.openqa.selenium.devtools.v87.browser
and similar to other packages inside org.openqa.selenium.devtools
were requied. Without this multiple compile errors like:
package org.openqa.selenium.devtools.browser does not exist
are observed
Also the following run time errors indicate that selenium-api-4.0.0-alpha-7.jar
was build on JDK 11 and is notloadable in JDK 8.
This manifests through the runtime exception
java.lang.NoClassDefFoundError: Could not initialize class org.openqa.selenium.net.PortProber
at org.openqa.selenium.remote.service.DriverService$Builder.build(DriverService.java:401)
at org.openqa.selenium.chrome.ChromeDriverService.createServiceWithConfig(ChromeDriverService.java:133)
the usual classpath scan reveals the jar containing the class in question, to be actually present in classpath
find ~/.m2/repository/ -iname 'selenium*jar' |xargs -IX sh -c "echo X; jar tvf X" | tee a
and method signature exception
java.lang.NoSuchMethodError: java.io.FileReader.<init>(Ljava/io/File;Ljava/nio/charset/Charset;)V
at org.openqa.selenium.net.LinuxEphemeralPortRangeDetector.getInstance(LinuxEphemeralPortRangeDetector.java:36)
at org.openqa.selenium.net.PortProber.<clinit>(PortProber.java:42)
the method the exception is complainign was added in Java 11
- To get Google Chrome updates past version 108, one needs Windows 10 or later. Some development environment computers are using Windows 8.1
- When Chromium browser installed via snapd on Ubuntu 20.04, all tests are failing with
org.openqa.selenium.SessionNotCreatedException: Could not start a new session. Response code 500. Message: unknown error: DevToolsActivePort file doesn't exist
Host info: host: 'lenovoy40-1', ip: '127.0.1.1'
Build info: version: '4.10.0', revision: 'c14d967899'
System info: os.name: 'Linux', os.arch: 'amd64', os.version: '5.4.0-150-generic', java.version: '1.8.0_161'
Driver info: org.openqa.selenium.chrome.ChromeDriver
Command: [null, newSession {capabilities=[Capabilities {browserName: chrome, goog:chromeOptions: {args: [--remote-allow-origins=*, --allow-insecure-localhost, --allow-running-insecure-co..., --browser.download.folderLi..., --browser.helperApps.neverA..., --disable-blink-features=Au..., --disable-default-app, --disable-dev-shm-usage, --disable-extensions, --disable-gpu, --disable-infobars, --disable-in-process-stack-..., --disable-logging, --disable-notifications, --disable-popup-blocking, --disable-save-password-bubble, --disable-translate, --disable-web-security, --enable-local-file-accesses, --ignore-certificate-errors, --ignore-certificate-errors, --ignore-ssl-errors=true, --log-level=3, --no-proxy-server, --no-sandbox, --output=/dev/null, --ssl-protocol=any, --user-agent=Mozilla/5.0 (W...], extensions: []}}]}]
at org.openqa.selenium.remote.ProtocolHandshake.createSession(ProtocolHandshake.java:140)
at org.openqa.selenium.remote.ProtocolHandshake.createSession(ProtocolHandshake.java:96)
at org.openqa.selenium.remote.ProtocolHandshake.createSession(ProtocolHandshake.java:68)
at org.openqa.selenium.remote.HttpCommandExecutor.execute(HttpCommandExecutor.java:163)
at org.openqa.selenium.remote.service.DriverCommandExecutor.invokeExecute(DriverCommandExecutor.java:196)
at org.openqa.selenium.remote.service.DriverCommandExecutor.execute(DriverCommandExecutor.java:171)
at org.openqa.selenium.remote.RemoteWebDriver.execute(RemoteWebDriver.java:531)
at org.openqa.selenium.remote.RemoteWebDriver.startSession(RemoteWebDriver.java:227)
at org.openqa.selenium.remote.RemoteWebDriver.<init>(RemoteWebDriver.java:154)
at org.openqa.selenium.chromium.ChromiumDriver.<init>(ChromiumDriver.java:107)
at org.openqa.selenium.chrome.ChromeDriver.<init>(ChromeDriver.java:87)
at org.openqa.selenium.chrome.ChromeDriver.<init>(ChromeDriver.java:82)
at org.openqa.selenium.chrome.ChromeDriver.<init>(ChromeDriver.java:71)
at com.github.sergueik.selenium.BaseCdpTest.beforeClass(BaseCdpTest.java:124)
- when Chromium browser installed via apt, from
sudo add-apt-repository ppa:system76/pop
sudo apt update
sudo apt install -y -q chromium
the tests would work but the browser is quite old version 83.
The latest available version of chromium-broser on http://archive.ubuntu.com/ubuntu/pool/universe/c/chromium-browser/
is 112
NOTE: will have to download a few packages to be able to install chromium-browser
:
chromium-browser_112.0.5615.49-0ubuntu0.18.04.1_amd64.deb
chromium-browser-l10n_112.0.5615.49-0ubuntu0.18.04.1_all.deb
chromium-codecs-ffmpeg_112.0.5615.49-0ubuntu0.18.04.1_amd64.deb
chromium-codecs-ffmpeg-extra_112.0.5615.49-0ubuntu0.18.04.1_amd64.deb
and install them in specific order:
dpkg -i chromium-codecs-ffmpeg_112.0.5615.49-0ubuntu0.18.04.1_amd64.deb
dpkg -i chromium-codecs-ffmpeg-extra_112.0.5615.49-0ubuntu0.18.04.1_amd64.deb
dpkg -i chromium-browser_112.0.5615.49-0ubuntu0.18.04.1_amd64.deb
Since the version mismatch the test log will contain plenty of
Match
WARNING: Unable to find CDP implementation matching 112
Jun 15, 2023 9:56:53 AM org.openqa.selenium.chromium.ChromiumDriver lambda$new$4
WARNING: Unable to find version of CDP to use for . You may need to include a dependency on a specific version of the CDP using something similar to `org.seleniumhq.selenium:selenium-devtools-v86:4.10.0` where the version ("v86") matches the version of the chromium-based browser you're using and the version number of the artifact is the same as Selenium's.
- when Chrome latest stable deb package is downloaded and chrome is installed via
dpkg
cd ~/Downloads
wget https://dl.google.com/linux/direct/google-chrome-stable_current_amd64.deb
sudo apt install -y -q ./google-chrome-stable_current_amd64.deb
a longer version
cd ~/Downloads
wget -nv "https://dl.google.com/linux/direct/google-chrome-stable_current_amd64.deb"
sudo apt-get install -qqy libxss1 libappindicator1 libindicator7
sudo dpkg -i google-chrome-stable_current_amd64.deb
rm google-chrome-stable_current_amd64.deb
sudo apt-get install -qqy -f google-chrome-stable
dpkg -l google-chrome-stable
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)
||/ Name Version Architecture Description
+++-==============-============-============-=================================
ii google-chrome- 114.0.5735.1 amd64 The web browser from Google
Download latest Chromedriver
CHROMEDRIVER_VERSION=$(curl -s "http://chromedriver.storage.googleapis.com/LATEST_RELEASE")
PACKAGE_ARCHIVE='chromedriver_linux64.zip'
PLATFORM=linux64
URL="http://chromedriver.storage.googleapis.com/${CHROMEDRIVER_VERSION}/chromedriver_${PLATFORM}.zip"
wget -O $PACKAGE_ARCHIVE -nv $URL
unzip -o $PACKAGE_ARCHIVE
NOTE: the latest version of Chrome Driver that can be downloaded this way is 114.0.5735.90
the tests work
NOTE: is is better to use
dpkg -i ./google-chrome-stable_current_amd64.deb
the apt
has the warning:
W: Repository is broken: google-chrome-stable:amd64 (= 114.0.5735.133-1) has no Size information
one may configure
``
and subscribe to events Browser.downloadWillBegin
and `Browser.downloadProgress` via CDP:
chromeDevTools.addListener(Browser.downloadWillBegin(),
(DownloadWillBegin o) -> {
System.err.println("in Browser.downloadWillBegin listener. url: " + o.getUrl() + "\tfilename: " + o.getSuggestedFilename());
});
List<DownloadProgress.State> states = new ArrayList<>();
chromeDevTools.addListener(Browser.downloadProgress(),
(DownloadProgress o) -> {
DownloadProgress.State state = o.getState();
System.err.println("in Browser.downloadProgress listener. state: " + state.toString());
states.add(state);
});
in Browser.downloadWillBegin listener. url: https://scholar.harvard.edu/files/torman_personal/files/samplepptx.pptx filename: samplepptx.pptx
in Browser.downloadProgress listener. state: inProgress
in Browser.downloadProgress listener. state: inProgress
in Browser.downloadProgress listener. state: inProgress
in Browser.downloadProgress listener. state: inProgress
in Browser.downloadProgress listener. state: inProgress
in Browser.downloadProgress listener. state: inProgress
in Browser.downloadProgress listener. state: inProgress
in Browser.downloadProgress listener. state: inProgress
in Browser.downloadProgress listener. state: inProgress
in Browser.downloadProgress listener. state: inProgress
in Browser.downloadProgress listener. state: completed
in Browser.downloadProgress listener. state: completed
Inspecting downloaded filename: f5a83cbd-97fb-451e-8651-618f63c1ec59
Verified downloaded file: f5a83cbd-97fb-451e-8651-618f63c1ec59 in /tmp
Starting with version 115 the Chrome browser and ChromeDriver
information is located on
Chrome for Testing availability
dashboard page.
A broader listing of Chrome versions can be found in https://googlechromelabs.github.io/chrome-for-testing/known-good-versions-with-downloads.json
:
curl -k -O https://googlechromelabs.github.io/chrome-for-testing/known-good-versions-with-downloads.json
jq '.versions[] | select(.version | contains( "114.")) ' known-good-versions-with-downloads.json | jq '.downloads[]|.[]|select(.platform |contains("linux64"))'
{
"platform": "linux64",
"url": "https://edgedl.me.gvt1.com/edgedl/chrome/chrome-for-testing/114.0.5696.0/linux64/chrome-linux64.zip"
}
- extract the
chrome
andchromedriver
links viajq
(unfinished for chromedriver, save the full JSON locally and finish the query)
The instructions of version lookup are provided on version selection hint page. Prior to that the chromedriver
download links were posted on Chromedriver Downloads page
- With Selenium version 4.14.0 onwards one has to build it on JDK 11_ or later, even if targeting Java 1.8 in
pom.xml
-* Require Java 11 (#12843)
is noted in the Selenium Changelog.
The attempt to build with JDK 1.8 fails with
[ERROR] bad class file: .m2\repository\org\seleniumhq\selenium\selenium-api\4.14.0\selenium-api-4.14.0.jar
[ERROR] class file has wrong version 55.0, should be 52.0
if seeing the version mismatch error in every test:
mvn test
org.openqa.selenium.SessionNotCreatedException: Could not start a new session. Response code 500. Message: session not created: This version of ChromeDriver only supports Chrome version 122
Current browser version is 121.0.6167.85 with binary path /opt/google/chrome/chrome
make sure to have the google repository added:
echo "deb [arch=amd64] http://dl.google.com/linux/chrome/deb/ stable main" | sudo tee /etc/apt/sources.list.d/google-chrome.list
and the error is cleared:
apt-get install google-chrome-stable
Reading package lists... Done
Building dependency tree
Reading state information... Done
Package google-chrome-stable is not available, but is referred to by another package.
This may mean that the package is missing, has been obsoleted, or
is only available from another source
E: Package 'google-chrome-stable' has no installation candidate
dpkg-reconfigure google-chrome-stable
/usr/sbin/dpkg-reconfigure: google-chrome-stable is broken or not fully installed
the install the chrome:
apt-get install google-chrome-stable
and confirm the test to pass
A textbook File Upload Form looks like below
<html>
<head>
<title>File Upload Test</title>
</head>
<body>
<h1>File Upload Test</h1>
<form enctype = "multipart/form-data" action="upload endpoint url" method="POST">
upload file path: <input name="upload file path" type="file">
<input type="submit" value="send file">
</form>
</body>
The requirements for plain browser-driven file upload HTML page are:
- form must specify the
POST
method - form must specify an enctype of
multipart/form-data
- form must contain an
<input type="file">
element
To examine the upload request genetated by the browser, subscribe to `` event:
Map<String, Map<String, Object>> requests = new HashMap<>();
chromeDevTools.send(Network.enable(Optional.empty(), Optional.empty(), Optional.empty()));
chromeDevTools.addListener(Network.requestWillBeSent(),
(RequestWillBeSent event) ->
requests.put(event.getRequest().getUrl(), event.getRequest().getHeaders().toJson())
);
and perform the upload:
wait = new WebDriverWait(driver, Duration.ofSeconds(flexibleWait));
wait.pollingEvery(Duration.ofMillis(pollingInterval));
actions = new Actions(driver);
url = "https://ps.uci.edu/~franklin/doc/file_upload.html";
driver.get(url);
element = wait.until(ExpectedConditions.visibilityOfElementLocated(By.cssSelector("input[name='userfile']")));
assertThat(element.isDisplayed(), is(true));
Utils.highlight(element);
element.sendKeys(dummy.getAbsolutePath());
element = driver.findElement(By.tagName("form"));
assertThat(element, notNullValue());
assertThat(element.getAttribute("action"), notNullValue());
url2 = element.getAttribute("action");
element = driver.findElement(By.cssSelector("input[type='submit']"));
assertThat(element, notNullValue());
Utils.highlight(element);
requests.clear();
element.submit();
try {
Thread.sleep(1000);
} catch (InterruptedException e) {
e.printStackTrace();
}
System.err.println("Captured: ");
requests.keySet().stream().forEach(System.err::println);
this will print:



https://ps.uci.edu/favicon.ico
https://www.oac.uci.edu/indiv/franklin/cgi-bin/values
the following confirms there is more than one data chunk:
Pattern pattern = Pattern.compile("data:image/png;base64");
System.err.println("Pattern:\n" + pattern.toString());
cnt = 0;
for (String x : requests.keySet()) {
Matcher matcher = pattern.matcher(x);
if (matcher.find()) {
cnt++;
}
}
assertThat(cnt, greaterThan(1));
the following prints the headers sent to file upload endpoint:
System.err.println("Headers: " + requests.get(url2).toString());
Headers: {
Content-Type=multipart/form-data;
boundary=----WebKitFormBoundaryhsNWaugzUXymNUzV,
Origin=https://ps.uci.edu,
Referer=https://ps.uci.edu/,
Upgrade-Insecure-Requests=1,
User-Agent=Mozilla/5.0 (Windows NT 6.1; WOW64; rv:33.0) Gecko/20120101 Firefox/33.0,
sec-ch-ua="Chromium";v="124",
"Google Chrome";v="124",
"Not-A.Brand";v="99",
sec-ch-ua-mobile=?0,
sec-ch-ua-platform="Linux"
}
-
chrome devtools project
-
standalond java cdp client (commecial)
-
headless chrome devtools based testing lecture and video(in Russan)
-
HTML to PDF conversion with Chromium devtools and Selenium Python client (in Russan)
-
Selenium 4 Relative Locator DOM traversal DSL.
-
Selenium 4 Relatve locator examples
-
alternative java websocket client HubSpot/ChromeDevToolsClient for the Chrome DevTools Protocol
-
another chrome-devtools-java-client project featuring annotation-style builder pattern design for handling chrome commandline arguments.
-
examples with callback hook example
-
yet another project chrome-devtools-webdriver-integration
-
yet another framework project sachinguptait/SeleniumAutomation demonstrating Selenium 4 and CDP features
-
yet another project SrinivasanTarget/selenium4CDPsamples
-
Python Chrome Devtools Procotol client
-
yet another CDP Java client . Note: only works with Selenium 4.alpha-2
-
Python Chrome Devtools Procotol client
-
Selenium 3.x CDP Exender clone project - the original project sahajamit/chrome-devtools-webdriver-integration - is somewhat stale
-
Selenium 4.0x
WindowType
feature -
use CDP to switch to winows stackoverflow
-
JavaScript-style event callback design jpuppeteer
-
library inspired by Puppeteer to facilitate the use of Chrome DevTools API to control Chrome or Chromium via Java
-
overview of DevTools access offered by Selenium 4
-
Libraries.io - monitors over 2 million open source libraries/packages from 36 package managers
-
https://medium.com/codex/selenium4-a-peek-into-chrome-devtools-92bca6de55e0
-
BiDirectional WebDriver Protocol w3c spec
-
BiDi - The future of cross-browser automation blog
-
BiDirectional functionality official documentation
-
list of bidi APIs examples
-
translation of the guide To Java 8 Optional(in Russian)
-
Ferrum - Ruby gem for "high-level" API for CDP backed Chrome browser automation rubycdp/ferrum repository, rubygems.org link documentation translation (in Russian)
-
serg-ty/selenium-tests-logger project to enable listeners as part of the logging
-
getting connection information and cookies from chrome dev tools and use with curl
This project is licensed under the terms of the MIT license.